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Introduction 


A popular buzzword has been “conver- 
gence”—the intersection of various technolo- 
gies that were previously unrelated. One of 
the key elements of multimedia convergence 
in the home and business has been video. 

A few short years ago, the applications for 
video were somewhat confined—analog broad- 
cast and cable television, analog VCRs, analog 
settop boxes with limited functionality, and 
simple analog video capture for the personal 
computer (PC). Since then, there has been a 
tremendous and rapid conversion to digital 
video, mostly based on the MPEG and DV 
(Digital Video) standards. 

Today we have: 


¢ DVD and SuperVCD Players and Record- 
ers. An entire movie can be stored digi- 
tally on a single disc. Although early 
systems supported composite and s- 
video, they rapidly added component 
video connections for higher video qual- 
ity. The latest designs already support 
progressive scan capability, pushing the 
video quality level even higher. 


Digital VCRs and Camcorders. DVCRs 
that store digital audio and video on tape 
are now common. Many include an 
IEEE 1394 interface to allow the trans- 
fer of audio and video digitally in order 
to maintain the high quality video and 
audio. 


Digital Settop Boxes. These interface the 
television to the digital cable, satellite, 
or broadcast system. In addition, many 
now also provide support for interactiv- 
ity, datacasting, sophisticated graphics, 
and internet access. Many will include 
DVI and IEEE 1394 interfaces to allow 
the transfer of audio, video, and data 
digitally. 


Digital Televisions (DTV). These 
receive and display digital television 
broadcasts, either via cable, satellite, or 
over-the-air. Both standard-definition 
(SDTV) and high-definition (HDTV) 
versions are available. 
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© Game Consoles. Powerful processing 
and graphics provide realism, with the 
newest systems supporting DVD play- 
back and internet access. 


e Video Editing on the Personal Computer. 
Continually increasing processing 
power allows sophisticated video edit- 
ing, real-time MPEG decoding, fast 
MPEG encoding, etc. 


e Digital Transmission of Content. This 
has now started for broadcast, cable, 
and satellite systems. The conversion to 
HDTV has started, although many 
countries are pursuing SDTV, upgrad- 
ing to HDTV at a later date. 


Of course, there are multiple HDTV and 
SDTV standards, with the two major differ- 
ences being the USA-based ATSC (Advanced 
Television Systems Committee) and the Euro- 
pean-based DVB (Digital Video Broadcast). 
Each has minor variations that is unique to 
each country's requirements regarding band- 
width allocation, channel spacing, receiving 
distance, etc. 

Adding to this complexity is the ability to 
support: 


© Captioning, Teletext, and V-Chip. With 
the introduction of digital transmission, 
the closed captioning, teletext, and vio- 
lence blocking (“V-chip”) standards had 
to be redefined. 


e Interactivity. This new capability allows 
television viewers to respond in real- 
time to advertisements and programs. 
Example applications are ordering an 
item that is being advertised or playing 
along with a game show contestant. 


¢ Datacasting. This new technology trans- 
mits data, such as the statistics of the 
pitcher during a baseball game, stock 
market quotes, software program 
updates, etc. Although datacasting has 
been implemented using analog teletext 
capability, digital implementations are 
able to transfer much more data in 
much less time. 


e Electronic Program Guides. EPGs are 
moving from being simple scrolling dis- 
plays to sophisticated programs that 
learn your viewing habits, suggest pro- 
grams, and automatically record pro- 
grams to a hard drive for later viewing. 


In addition to the MPEG and DV stan- 
dards, there are several standards for transfer- 
ring digital video between equipment. They 
promise much higher video quality by elimi- 
nating the digital-to-analog and analog-to-digi- 
tal conversions needed for analog interfaces. 


e IEEE 1394. This high-speed network 
enables transferring real-time com- 
pressed, copy-protected digital video 
between equipment. It has been popular 
on digital camcorders for the last few 
years. 


e DVI. The Digital Visual Interface allows 
the transfer for real-time uncompressed, 
copy-protected digital video between 
equipment. Originally developed for 
PCs, it is applicable to any device that 
needs to interface to a display. 


e USB. The 480 Mbps version of Univer- 
sal Serial Bus enables transferring real- 
time uncompressed, copy-protected dig- 
ital video between equipment. 
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Of course, in the middle of all of this is the 
internet, capable of transferring compressed 
digital video and audio around the world to any 
user at any time. 

This third edition of Video Demystified has 
been updated to reflect these changing times. 
Implementing “real-world” video is not easy, 
and many engineers have little knowledge or 
experience in this area. This book is a guide 
for those engineers charged with the task of 
understanding and implementing video fea- 
tures into next-generation designs. 

This book can be used by engineers who 
need or desire to learn about video, VLSI 
design engineers working on new video prod- 
ucts, or anyone who wants to evaluate or sim- 
ply know more about video systems. 


Contents 


The remainder of the book is organized as fol- 
lows: 

Chapter 2, an introduction to video, dis- 
cusses the various video formats and signals, 
where they are used, and the differences 
between interlaced and progressive video. 
Block diagrams of DVD players and digital set- 
top boxes are provided. 

Chapter 3 reviews the common color 
spaces, how they are mathematically related, 
and when a specific color space is used. Color 
spaces reviewed include RGB, YUV, YIQ, 
YCbCr, HSI, HSV, and HLS. Considerations for 
converting from a non-RGB to a RGB color 
space and gamma correction are also dis- 
cussed. 

Chapter 4 is a video signals overview that 
reviews the video timing, analog representa- 
tion, and digital representation of various video 
formats, including 480i, 480p, 576i, 576p, 720p, 
1080i, and 1080p. 


Chapter 5 discusses the analog video inter- 
faces, including the analog RGB, YPbPr, s- 
video, and SCART interfaces for SDTV and 
HDTV consumer and pro-video applications. 

Chapter 6 discusses the various parallel 
and serial digital video interfaces for semicon- 
ductors, pro-video equipment, and consumer 
SDTV and HDTV equipment. Reviews the 
BT.656, VMI, VIP. and ZV Port semiconductor 
interfaces, the SDI, SDTI and HD-SDTI pro- 
video interfaces, and the DVI, DFP OpenLDI, 
GVIFK and JEEE 1394 consumer interfaces. 
Also reviewed are the formats for digital audio, 
timecode, error correction, etc. for transmis- 
sion over various digital interfaces. 

Chapter 7 covers several digital video pro- 
cessing requirements such as 4:4:4 to 4:2:2 
YCbCr, YCbCr digital filter templates, scaling, 
interlaced/noninterlaced conversion, scan rate 
conversion (also called frame-rate, field-rate, or 
temporal-rate conversion), alpha mixing, 
flicker filtering, chroma keying, and DCT- 
based video compression. Brightness, con- 
trast, saturation, hue, and sharpness controls 
are also discussed. 

Chapter 8 provides an NTSC, PAL, and 
SECAM overview. The various composite ana- 
log video signal formats are reviewed, along 
with video test signals. VBI data discussed 
includes timecode (VITC and LTC), closed 
captioning and extended data services (XDS), 
widescreen signaling (WSS), and teletext. In 
addition, PALplus, RF modulation, BTSC and 
Zweiton analog stereo audio, and NICAM 728 
digital stereo audio are reviewed. 

Chapter 9 covers digital techniques used 
for the encoding and decoding of NTSC and 
PAL color video signals. Also reviewed are var- 
ious luma/chroma (Y/C) separation tech- 
niques and their trade-offs. 

Chapter 10 discusses the H.261 and H.263 
video compression standards used for video 
teleconferencing. 
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Chapter 11 discusses the Consumer DV 
digital video compression standards used by 
digital VCRs and digital camcorders. 

Chapter 12 reviews the MPEG 1 video 
compression standard. 

Chapter 13 discusses the MPEG 2 video 
compression standard used by DVD, SVCD, 
and DTV. 

Chapter 14 is a Digital Television (DTV) 
overview, discussing the ATSC and DVB 
SDTV and HDTV standards. 

Finally, a glossary of over 400 video terms 
has been included for reference. If you encoun- 
ter an unfamiliar term, it likely will be defined 
in the glossary. 


Organization Addresses 


Many standards organizations, some of which 
are listed below, are involved in specifying 
video standards. 


Advanced Television Systems Committee (ATSC) 
1750 K Street NW 
Suite 1200 
Washington, DC 20006 
Tel: (202) 828-3130 
Fax: (202) 828-3131 
http://www.atsc.org/ 


Digital Video Broadcasting (DVB) 
17a Ancienne Route 
CH-1218 Grand Sacconnex 
Geneva, 
Switzerland 
Tel: +41 22 717 27 19 
Fax: +41 22 717 27 27 
http://www.dvb.org/ 


European Broadcasting Union (EBU) 
Ancienne route 17A 
CH-1218 Grand-Saconnex GE 
Switzerland 
Tel: +41-22-717-2111 
Fax: +41-22-717-4000 
http://www.ebu.ch/ 


Electronic Industries Alliance (ETA) 
2500 Wilson Boulevard 
Arlington, Virginia 22201 
Tel: (703) 907-7500 
Fax: (703) 907-7501 
http://www.eia.org/ 


European Telecommunications Standards 
Institute (ETSI) 
650, route des Lucioles 
06921 Sophia Antipolis, France 
Tel: +33 4 92 94 42 00 
Fax: +33 4 93 65 47 16 
http://www.etsi.org/ 


International Electrotechnical Commission (IEC) 
3, rue de Varembé 
PO. Box 131 
CH - 1211 GENEVA 20 
Switzerland 
Tel: +41 22 919 02 11 
Fax: +41 22 919 03 00 
http://www.iec.ch/ 


Institute of Electrical and Electronics Engineers 
(IEEE) 

1828 L Street, N.W., Suite 1202 

Washington, D.C. 20036 

Tel: (202) 785-0017 

http://www.ieee.org/ 
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International Telecommunication Union (ITU) 
Place des Nations 
CH-1211 Geneva 20 
Switzerland 
Tel: +41 22 730 5111 
Fax: +41 22 733 7256 
http://www.itu.int/ 


Society of Cable Telecommunications Engineers 
(SCTE) 

140 Philips Road 

Exton, PA 19341 

Tel: (800) 542-5040 

Fax: (610) 363-5898 

http://www.scte.org/ 


Society of Motion Picture and Television 
Engineers (SMPTE) 
595 West Hartsdale Avenue 
White Plains, New York 10607 USA 
Tel: (914) 761-1100 
Fax: (914) 761-3115 
http://www.smpte.org/ 


Video Electronics Standards Association (VESA) 
920 Hillview Ct., Suite 140 
Milpitas, CA 95035 
Tel: (408) 957-9270 
http://www.vesa.org/ 


Video Demystified Web Site 


At the Video Demystified web site, you'll find 
links to chip, PC add-in board, system, and 
software companies that offer video products. 
Links to related on-line periodicals, news- 
groups, standards, standards organizations, 
associations, and books are also available. 


http://www.video-demystified.com/ 


Introduction 
to Video 


Although there are many variations and imple- 
mentation techniques, video signals are just a 
way of transferring visual information from 
one point to another. The information may be 
from a VCR, DVD player, a channel on the local 
broadcast, cable television, or satellite system, 
the internet, game console, or one of many 
other sources. 

Invariably, the video information must be 
transferred from one device to another. It 
could be from a settop box or DVD player toa 
television. Or it could be from one chip to 
another inside a settop box or television. 
Although it seems simple, there are many dif- 
ferent requirements, and therefore, many dif- 
ferent ways of doing it. 


Analog vs. Digital 


Until recently, most video equipment was 
designed primarily for analog video. Digital 
video was confined to professional applica- 
tions, such as video editing. 


The average consumer now has access to digi- 
tal video thanks to continuing falling costs. 
This trend has led to the development of DVD 
players, digital settop boxes, digital television 
(DTV), and the ability to use the internet for 
transferring video data. 


Video Data 


Initially, video contained only analog gray-scale 
(also called black-and-white) information. 
While color broadcasts were being devel- 
oped, attempts were made to transmit color 
video using analog RGB (red, green, blue) 
data. However, this technique occupied 3¥ 
more bandwidth than the current gray-scale 
solution, so alternate methods were developed 
that led to using YIQ or YUV data to represent 
color information. A technique was then devel- 
oped to transmit this analog YIQ or YUV infor- 
mation using one signal, instead of three 
separate signals, and in the same bandwidth as 
the original gray-scale video signal. This com- 
posite video signal is what the NTSC, PAL, and 
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SECAM video standards are still based on 
today. This technique is discussed in more 
detail in Chapters 8 and 9. 

Today, even though there are many ways 
of representing video, they are still all related 
mathematically to RGB. These variations are 
discussed in more detail in Chapter 3. 

Several years ago, s-video was developed 
for connecting consumer equipment together 
(it is not used for broadcast purposes). It is a 
set of two analog signals, one analog Y and one 
that carries the analog U and V information in 
a specific format (also called C or chroma). 
Once available only on S-VHS machines, it is 
now present on many televisions, settop boxes, 
and DVD players. This is discussed in more 
detail in Chapter 9. 

Although always used by the professional 
video market, analog RGB video data has made 
a come-back for connecting consumer equip- 
ment together. Like s-video, it is not used for 
broadcast purposes. 

A variation of the analog YUV video signal, 
called YPbPy, is now also used for connecting 
consumer equipment together. Some manufac- 
turers incorrectly label the YPbPr connectors 
YUV, YCbCr, or Y(B-Y) (R-Y). 

Chapter 5 discusses the various analog 
interconnect schemes in detail. 


Digital Video 


Recently, digital video has become available to 
consumers, and is rapidly taking over most of 
the video applications. 

The most common digital signals used are 
RGB and YCbCr. RGB is simply the digitized 
version of the analog RGB video signals. 
YCbCr is basically the digitized version of the 
analog YUV and YPbPr video signals. YCbCr is 
the format used by DVD and digital television. 


Chapter 6 further discusses the various 
digital interconnect schemes. 


Best Connection Method 


There is always the question of “what is the 
best connection method for equipment?”. For 
consumer equipment, in order of decreasing 
video quality, here are the alternatives: 


1. Digital YCbCr 

2. Digital RGB 

3. Analog YPbPr 

4, Analog RGB 

5. Analog S-video 

6. Analog Composite 


Some will disagree about the order of ana- 
log YPbPr vs. analog RGB. However, most of 
the latest televisions, DVD players, personal 
video recorders (PVRs), and digital settop 
boxes do video processing in the YCbCr color 
space. Therefore, using analog YPbPr as the 
interconnect for equipment reduces the num- 
ber of color space conversions required. 

The same reasoning is used for placing 
digital YCbCr above digital RGB, when digital 
interconnect is available for consumer equip- 
ment. 

The computer industry has standardized 
on analog and digital RGB for connecting to 
the computer monitor. 


Video Timing 


Although it looks like video is continuous 
motion, it is actually a series of still images, 
changing fast enough that it looks like continu- 
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IMAGE 1 | TIME 


Figure 2.1. Video Is Composed of a Series of Still Images. Each Image Is Composed of Individual 


Lines of Data. 


ous motion, as shown in Figure 2.1. This typi- 
cally occurs 50 or 60 times per second for 
consumer video, and 70-90 times per second 
for computers. Therefore, timing information, 
called vertical sync, is needed to indicate when 
anew image is starting. 

Each still image is also composed of scan 
lines, lines of data that occur sequentially one 
after another down the display, as shown in 
Figure 2.1. Thus, timing information, called 
horizontal sync, is needed to indicate when a 
new scan line is starting. 

The vertical and horizontal sync informa- 
tion is usually transferred in one of three ways: 


1. Separate horizontal and vertical sync 
signals 
2. Separate composite sync signal 


3. Composite sync signal embedded within 
the video signal. 


The composite sync signal is a combina- 
tion of both vertical and horizontal sync. 

Computers and consumer equipment that 
use analog RGB video usually rely on tech- 
niques 1 and 2. Devices that use analog YPbPr 
video usually use technique 3. 

For digital video, either technique 1 is 
commonly used or timing code words are 
embedded within the digital video stream. This 
can be seen in Chapter 6. 


Interlaced vs. Progressive 


Since video is a series of still images, it makes 
sense to just display each full image consecu- 
tively, one after the another. 

This is the basic technique of progressive, 
or non-interlaced, displays. For displays that 
“paint” an image on the screen, such as a CRT, 
each image is displayed starting at the top left 
corner of the display, moving to the right edge 
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of the display. Then scanning then moves 
down one line, and repeats scanning left-to- 
right. This process is repeated until the entire 
screen is refreshed, as seen in Figure 2.2. 

In the early days of television, a technique 
called “interlacing” was used to reduce the 
amount of information sent for each image. By 
transferring the odd-numbered lines, followed 
by the even-numbered lines (as shown in Fig- 
ure 2.3), the amount of information sent for 
each image was halved. Interlacing is still used 
for most consumer applications, except for 
computer monitors and some new digital tele- 
vision formats. 

Given this advantage of interlaced, a com- 
mon question is why bother to use progres- 
sive? 

With interlace, each scan line is refreshed 
half as often as it would be if it were a progres- 
sive display. Therefore, to avoid line flicker on 
sharp edges due to a too-low refresh rate, the 
line-to-line changes are limited, essentially by 
vertically lowpass filtering the image. A pro- 
gressive display has no limit on the line-to-line 
changes, so is capable of providing a higher- 
resolution image (vertically) without flicker. 

However, a progressive display will show 
50 or 60 Hz flicker in large regions of constant 
color. Therefore, it is useful to increase the dis- 
play refresh, to 72 Hz for example. However, 
this increases the cost of the CRT circuitry 
and the video processing needed to generate 
additional images from the 50 or 60 Hz source. 

For the about same cost as a 50 or 60 Hz 
progressive display, the interlaced display can 
double its refresh rate (to 100 or 120 Hz) in an 
attempt to remove flicker. Thus, the battle 
rages on. 


Video Resolution 


Video resolution is one of those “fuzzy” things 
in life. It is common to see video resolutions of 
720 x 480 or 1920 x 1080. However, those are 
just the number of horizontal samples and ver- 
tical scan lines, and do not necessarily convey 
the amount of unique information. 

For example, an analog video signal can be 
sampled at 13.5 MHz to generate 720 samples 
per line. Sampling the same signal at 27 MHz 
would generate 1440 samples per line. How- 
ever, only the number of samples per line has 
changed, not the resolution of the content. 

Therefore, video is usually measured 
using “lines of resolution”. In essence, how 
many distinct black and white vertical lines can 
be seen across the display? This number is 
then normalized to a 1:1 display aspect ratio 
(dividing the number by 3/4 for a 4:3 display, 
or by 9/16 for a 16:9 display). Of course, this 
results in a lower value for widescreen (16:9) 
displays, which goes against intuition. 


Standard Definition 


Standard definition video usually has an active 
resolution of 720 x 480 or 720 x 576 interlaced. 
This translates into a maximum of about 540 
lines of resolution, or a 6.75 MHz bandwidth. 

Standard NTSC, PAL, and SECAM sys- 
tems fit into this category. For broadcast 
NTSC, with a maximum bandwidth of about 
4.2 MHz, this results in about 330 lines of reso- 
lution. 
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Figure 2.2. Progressive Displays “Paint” the Lines of An Image Consecutively, One After Another. 


Figure 2.3. Interlaced Displays “Paint” First One-Half of the Image (Odd Lines), Then the Other 


Half (Even Lines). 
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Enhanced Definition 


The latest new category, enhanced definition 
video, is usually touted as having an active res- 
olution of 720 x 480 progressive or greater. 

The basic difference between standard 
and enhanced definition is that standard defini- 
tion is interlaced, while enhanced definition is 
progressive. 


High Definition 


High definition video is usually defined as hav- 
ing an active resolution of 1920 x 1080 inter- 
laced or 1280 x 720 progressive. 


Video Compression 


The latest advances in consumer electronics, 
such as digital television (cable, satellite, and 
broadcast), DVD players and recorders, and 
PVRs, were made possible due to audio and 
video compression, based largely on MPEG 2. 

Core to video compression are motion esti- 
mation (during encoding), motion compensa- 
tion (during decoding), and the discrete cosine 
transform (DCT). Since there are entire books 
dedicated to these subjects, they are covered 
only briefly in this book. 


Application Block Diagrams 


Looking at a few simplified block diagrams 
helps envision how video flows through its var- 
ious operations. 


Video Capture Boards 


Figure 2.4 illustrates two common implementa- 
tions for video capture boards for the PC. 


In the first diagram, uncompressed video 
is sent to memory for processing and display 
via the PCI bus. More recent versions are able 
to also digitize the audio and send it to mem- 
ory via the PCI bus, rather than driving the 
sound card directly. 

In the second diagram, the video is input 
directly into the graphics controller chip, 
which sizes and positions the video for display. 
This implementation has the advantage of min- 
imizing PCI or AGP bus bandwidth. 

In either case, the NTSC/PAL decoder 
chip could be replaced with a DTV decoder 
solution to support digital television viewing. 


DVD Players 


Figure 2.5 is a simplified block diagram for a 
DVD player, showing the common audio and 
video processing blocks. 

DVD is based on MPEG 2 video compres- 
sion, and Dolby Digital or DTS audio compres- 
sion. The information is also scrambled (CSS) 
on the disc to copy protect it. 

The sharpness adjustment was originally 
used to compensate for the “tweaking” televi- 
sions do to the video signal before display. 
Unless the sharpness control of the television 
is turned down, DVD sources can look poor 
due to it being much better than typical broad- 
cast sources. To compensate, DVD players 
added a sharpness control to dull the image; 
the television “tweaks” the sharpness back up 
again. This avoided turning the sharpness up 
and down each time a different video source is 
selected (DVD vs. cable for example). With 
many televisions now able to have a sharpness 
adjustment for each individual input, having 
this control in the DVD player is redundant. 

There may also be user adjustments, such 
as brightness, contrast, saturation, and hue to 
enable adjusting the video quality to personal 
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Figure 2.4. Simplified Block Diagrams of a Video Capture Card for PCs. 


preferences. Again, with televisions now able 
to have these adjustments for each individual 
video input, they are largely redundant. 
In an attempt to “look different” on the 
showroom floor and quickly grab your atten- 
tion, some DVD players “tweak” the video fre- 
quency response. Since this “feature” is 
usually irritating over the long term, it should 
be defeated or properly adjusted. For the “film 
look” many viewers strive for, the frequency 
response should be as flat as possible. 


Another problem area is the output levels 
of the analog video signals. Although it is easy 
to generate very accurate video levels, they 
seem to vary considerably. Reviews are now 
pointing out this issue since switching between 
sources may mean changing brightness or 
black levels, defeating any television calibra- 
tion or personal adjustments that may have 
been done by the user. 
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Figure 2.5. Simplified Block Diagram of a DVD Player. 


Digital Television Settop Boxes 


The digital television standards fall into five 


major categories: 


1. ATSC (Advanced Television Systems 


Committee) 


2. DVB (Digital Video Broadcast) 
3. ARIB (Association of Radio Industries 


and Businesses) 


4, Digital cable standards, such as Open 


Cable 


These are based on MPEG 2 video com- 
pression, with Dolby Digital or MPEG audio 
compression. The transmission methods and 
capabilities beyond basic audio and video are 
the major differences between the standards. 

Figure 2.6 is a simplified block diagram for 
a digital television settop box, showing the 
common audio and video processing blocks. It 
is used to enable a standard television to dis- 
play digital television broadcasts, from either 
over-the-air, cable, or satellite. A digital televi- 
sion includes this circuitry inside the televi- 
sion. 


5. Proprietary standards, such as DirectTV 
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Figure 2.6. Simplified Block Diagram of a Digital Television Settop Box. 
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Color Spaces 


A color space is a mathematical representation 
of a set of colors. The three most popular color 
models are RGB (used in computer graphics); 
YIQ, YUV, or YCbCr (used in video systems); 
and CMYK (used in color printing). However, 
none of these color spaces are directly related 
to the intuitive notions of hue, saturation, and 
brightness. This resulted in the temporary pur- 
suit of other models, such as HSI and HSV, to 
simplify programming, processing, and end- 
user manipulation. 

All of the color spaces can be derived from 
the RGB information supplied by devices such 
as cameras and scanners. 


BLUE 


MAGENTA 


RGB Color Space 


The red, green, and blue (RGB) color space is 
widely used throughout computer graphics. 
Red, green, and blue are three primary addi- 
tive colors (individual components are added 
together to form a desired color) and are rep- 
resented by a three-dimensional, Cartesian 
coordinate system (Figure 3.1). The indicated 
diagonal of the cube, with equal amounts of 
each primary component, represents various 
gray levels. Table 3.1 contains the RGB values 
for 100% amplitude, 100% saturated color bars, 
a common video test signal. 
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YELLOW 


Figure 3.1. The RGB Color Cube. 
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R 0 to 255 255 255 0 0 255 255 0 
G 0 to 255 255 255 255 255 0 0 0 
B 0 to 255 255 0 255 0 255 0 255 


Table 3.1. 100% RGB Color Bars. 


The RGB color space is the most prevalent 
choice for computer graphics because color 
displays use red, green, and blue to create the 
desired color. Therefore, the choice of the 
RGB color space simplifies the architecture 
and design of the system. Also, a system that is 
designed using the RGB color space can take 
advantage of a large number of existing soft- 
ware routines, since this color space has been 
around for a number of years. 

However, RGB is not very efficient when 
dealing with “real-world” images. All three 
RGB components need to be of equal band- 
width to generate any color within the RGB 
color cube. The result of this is a frame buffer 
that has the same pixel depth and display reso- 
lution for each RGB component. Also, process- 
ing an image in the RGB color space is usually 
not the most efficient method. For example, to 
modify the intensity or color of a given pixel, 
the three RGB values must be read from the 
frame buffer, the intensity or color calculated, 
the desired modifications performed, and the 
new RGB values calculated and written back to 
the frame buffer. If the system had access to an 
image stored directly in the intensity and color 
format, some processing steps would be faster. 

For these and other reasons, many video 
standards use luma and two color difference 
signals. The most common are the YUV, YIQ, 


and YCbCr color spaces. Although all are 
related, there are some differences. 


YUV Color Space 


The YUV color space is used by the PAL 
(Phase Alternation Line), NTSC (National 
Television System Committee), and SECAM 
(Sequentiel Couleur Avec Mémoire or Sequen- 
tial Color with Memory) composite color video 
standards. The black-and-white system used 
only luma (Y) information; color information 
(U and V) was added in such a way that a 
black-and-white receiver would still display a 
normal black-and-white picture. Color receiv- 
ers decoded the additional color information to 
display a color picture. 

The basic equations to convert between 
gamma-corrected RGB (notated as R’G’B’ and 
discussed later in this chapter) and YUV are: 


Y = 0.299R’ + 0.587G’ + 0.114B’ 


U=-0.147R’ — 0.289G’ + 0.436B’ 
= 0.492 (B’ - Y) 


V = 0.615R’ — 0.515G’ — 0.100B’ 
= 0.877(R’ - Y) 
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R’=Y+1.140V 
G’ = Y -0.395U — 0.581V 
B’=Y +2.032U 


For digital R’'G’B’ values with a range of 0- 
255, Y has a range of 0-255, U a range of 0 to 
+112, and V a range of 0 to +157. These equa- 
tions are usually scaled to simplify the imple- 
mentation in an actual NTSC or PAL digital 
encoder or decoder. 

Note that for digital data, 8-bit YUV and 
R’G’B’ data should be saturated at the 0 and 
255 levels to avoid underflow and overflow 
wrap-around problems. 

If the full range of (B’— Y) and (R’ - Y) had 
been used, the composite NTSC and PAL lev- 
els would have exceeded what the (then cur- 
rent) black-and-white television transmitters 
and receivers were capable of supporting. 
Experimentation determined that modulated 
subcarrier excursions of 20% of the luma (Y) 
signal excursion could be permitted above 
white and below black. The scaling factors 
were then selected so that the maximum level 
of 75% amplitude, 100% saturation yellow and 
cyan color bars would be at the white level 
(100 IRE). 


YIQ Color Space 


The YIQ color space, further discussed in 
Chapter 8, is derived from the YUV color space 
and is optionally used by the NTSC composite 
color video standard. (The “I” stands for “in- 
phase” and the “Q” for “quadrature,” which is 
the modulation method used to transmit the 
color information.) The basic equations to con- 
vert between R’G’B’ and YIQ are: 


Y = 0.299R’ + 0.587G’ + 0.114B’ 


I = 0.596R’ — 0.275G’ — 0.321B’ 
= Vcos 33° — Usin 33° 
= 0.736(R’ — Y) — 0.268(B’ — Y) 


Q= 0.212R’ — 0.523G’ + 0.311B’ 
= Vsin 33° + Ucos 33° 
= 0.478(R’ — Y) + 0.413(B’ — Y) 


or, using matrix notation: 


I} _ JO 1}| cos(33) sin(33))|U 
Q 1 0] |—sin(33) cos(33)}|V 
R’ = Y + 0.956I + 0.621Q 


G' = Y- 0.2721 - 0.647Q 
B’ =Y-1.1071 + 1.704Q 


For digital R’'G’B’ values with a range of 0- 
255, Y has a range of 0-255, I has a range of 0 
to +152, and Q has a range of 0 to +134. Iand Q 
are obtained by rotating the U and V axes 33°. 
These equations are usually scaled to simplify 
the implementation in an actual NTSC digital 
encoder or decoder. 

Note that for digital data, 8-bit YIQ and 
R’G’B’ data should be saturated at the 0 and 
255 levels to avoid underflow and overflow 
wrap-around problems. 


YCbCr Color Space 


The YCbCr color space was developed as part 
of ITU-R BT.601 during the development of a 
world-wide digital component video standard 
(discussed in Chapter 4). YCbCr is a scaled 
and offset version of the YUV color space. Y is 
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defined to have a nominal 8-bit range of 16- 
235; Cb and Cr are defined to have a nominal 
range of 16-240. There are several YCbCr sam- 
pling formats, such as 4:4:4, 4:2:2, 4:1:1, and 
4:2:0 that are also described. 


RGB - YCbCr Equations: SDTV 
The basic equations to convert between 8bit 
digital R’G’B’ data with a 16-235 nominal 
range and YCbCr are: 

Ygo1 = 0.299R’ + 0.587G’ + 0.114B’ 

Cb = -0.172R’ — 0.339G’ + 0.511B’ + 128 

Cr = 0.511R’ - 0.428G’ — 0.083B’ + 128 


R’ = Ygo1 + 1.371(Cr— 128) 


G'= Y601 — 0.698 (Cr a 128) 
— 0.336 (Cb - 128) 


B’= Yeo. + 1.732 (Cb = 128) 


When performing YCbCr to R’G’B’ con- 
version, the resulting R’G’B’ values have a 
nominal range of 16-235, with possible occa- 
sional excursions into the 0-15 and 236-255 
values. This is due to Y and CbCr occasionally 
going outside the 16-235 and 16-240 ranges, 
respectively, due to video processing and 
noise. Note that 8-bit YCbCr and R’G’B’ data 
should be saturated at the 0 and 255 levels to 
avoid underflow and overflow wrap-around 
problems. 

Table 3.2 lists the YCbCr values for 75% 
amplitude, 100% saturated color bars, a com- 
mon video test signal. 


Computer Systems Considerations 

If the R’G’B’ data has a range of 0-255, as is 
commonly found in computer systems, the fol- 
lowing equations may be more convenient to 
use: 


Ygo1 = 0.257R’ + 0.504G’ + 0.098B’ + 16 
Cb = -0.148R’ - 0.291G’ + 0.439B’ + 128 
Cr = 0.439R’ - 0.368G’ - 0.071B’ + 128 


—_ iy] 
Se = 2 ° oO Ss a a) 
SDTV 
Y 16 to 235 180 162 131 112 84 65 35 16 
Cb 16 to 240 128 44 156 72 184 100 212 128 
Cr 16 to 240 128 142 44 58 198 212 114 128 
HDTV 
Y 16 to 235 180 168 145 133 63 51 28 16 
Cb 16 to 240 128 44 147 63 193 109 212 128 
Cr 16 to 240 128 136 44 52 204 212 120 128 


Table 3.2. 75% YCbCr Color Bars. 
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R’ = 1.164 (Yoo — 16) + 1.596(Cr — 128) 


G’ = 1.164 (Ygq1 - 16) - 0.813(Cr — 128) - 
0.391 (Cb - 128) 


B’ = 1.164 (Ygo1 — 16) + 2.018(Cb — 128) 


Note that 8-bit YCbCr and R’G’B’ data 
should be saturated at the 0 and 255 levels to 
avoid underflow and overflow wrap-around 
problems. 


RGB - YCbCr Equations: HDTV 


The basic equations to convert between 8&-bit 
digital R’G’B’ data with a 16-235 nominal 
range and YCbCr are: 


Y799 = 0.213R’ + 0.715G’ + 0.072B’ 
Cb = -0.117R’ - 0.394G’ + 0.511B’ + 128 
Cr = 0.511R’ — 0.464G’ - 0.047B’ + 128 


R’ = Y7o9 + 1.540(Cr - 128) 


G'= Y709 — 0.459 (Cr - 128) 
- 0.183 (Cb - 128) 


B’= Y709 oP 1.816(Cb = 128) 


When performing YCbCr to R’G’B’ con- 
version, the resulting R’G’B’ values have a 
nominal range of 16-235, with possible occa- 
sional excursions into the 0-15 and 236-255 
values. This is due to Y and CbCr occasionally 
going outside the 16-235 and 16-240 ranges, 
respectively, due to video processing and 
noise. Note that 8-bit YCbCr and R’G’B’ data 
should be saturated at the 0 and 255 levels to 
avoid underflow and overflow wrap-around 
problems. 

Table 3.2 lists the YCbCr values for 75% 
amplitude, 100% saturated color bars, a com- 
mon video test signal. 


Computer Systems Considerations 

If the R’G’B’ data has a range of 0-255, as is 
commonly found in computer systems, the fol- 
lowing equations may be more convenient to 
use: 


Y799 = 0.183R’ + 0.614G’ + 0.062B’ + 16 
Cb = -0.101R’ - 0.338G’ + 0.439B’ + 128 
Cr = 0.439R’ - 0.399G’ - 0.040B’ + 128 


R’ = 1.164 (Y799 — 16) + 1.793(Cr - 128) 


G’ = 1.164 (Y799 - 16) - 0.534(Cr - 128) - 
0.213 (Cb - 128) 


B’ = 1.164 (Y799— 16) + 2.115(Cb — 128) 


Note that 8bit YCbCr and R’G’B’ data 
should be saturated at the 0 and 255 levels to 
avoid underflow and overflow wrap-around 
problems. 


4:4:4 YCbCr Format 


Figure 3.2 illustrates the positioning of YCbCr 
samples for the 4:4:4 format. Each sample has 
a Y, a Cb, and a Cr value. Each sample is typi- 
cally 8 bits (consumer applications) or 10 bits 
(pro-video applications) per component. Each 
sample therefore requires 24 bits (or 30 bits 
for pro-video applications). 


4:2:2 YCbCr Format 


Figure 3.3 illustrates the positioning of YCbCr 
samples for the 4:2:2 format. For every two 
horizontal Y samples, there is one Cb and Cr 
sample. Each sample is typically 8 bits (con- 
sumer applications) or 10 bits (pro-video appli- 
cations) per component. Each sample 
therefore requires 16 bits (or 20 bits for pro- 
video applications), usually formatted as 
shown in Figure 3.4. 
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To display 4:2:2 YCbCr data, it is first con- 
verted to 4:4:4 YCbCr data, using interpolation 
to generate the missing Cb and Cr samples. 


4:1:1 YCbCr Format 


Figure 3.5 illustrates the positioning of YCbCr 
samples for the 4:1:1 format (also known as 
YUV12), used in some consumer video and DV 
video compression applications. For every four 
horizontal Y samples, there is one Cb and Cr 
value. Each component is typically 8 bits. Each 
sample therefore requires 12 bits, usually for- 
matted as shown in Figure 3.6. 

To display 4:1:1 YCbCr data, it is first con- 
verted to 4:4:4 YCbCr data, using interpolation 
to generate the missing Cb and Cr samples. 


ACTIVE X = FIELD 1 
LINE [X] = FIELD 2 
NUMBER 


©® © 
©® © 
© © 
©® © 
©® © 


© © 


© © © 
© ®® ® © 


) 
© © © 
© © © 


© © © 


© B,CR SAMPLE 


@ = Y SAMPLE 


Figure 3.2. 4:4:4 Co-Sited Sampling. The 
sampling positions on the active scan lines 
of an interlaced picture. 


4:2:0 YCbCr Format 


Rather than the horizontal-only 2:1 reduction 
of Cb and Cr used by 4:2:2, 4:2:0 YCbCr imple- 
ments a 2:1 reduction of Cb and Cr in both the 
vertical and horizontal directions. It is com- 
monly used for video compression. 

As shown in Figures 3.7 through 3.11, 
there are several 4:2:0 sampling formats. Table 
3.3 lists the YCbCr formats for various DV 
applications. 

To display 4:2:0 YCbCr data, it is first con- 
verted to 4:4:4 YCbCr data, using interpolation 
to generate the new Cb and Cr samples. 


ACTIVE X = FIELD 1 


LINE [X] = FIELD 2 

NUMBER 
1 —@©—e—_@_e—_ © 
[1] —@—e—@—e—_@©—_e 
2 —©—e—_@_e_@e 
[2] —@—e—@—e—@—e 
3 —@—e—_@_e_@—e 


© B,CR SAMPLE 


@ Y SAMPLE 


Figure 3.3. 4:2:2 Co-Sited Sampling. The 
sampling positions on the active scan lines 
of an interlaced picture. 
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SAMPLE | SAMPLE | SAMPLE | SAMPLE | SAMPLE | SAMPLE 
0 1 2 3 4 5 
y7-0 | Y7-1 | y7-2 | Y7-3 | Y7-4 | Y7-5 
Ye-o0 | Ye-1 | Ye-2 | Ye-3 | Ye-4 | Ye-5 
y5-0 | Y5-1 | Y5-2 | Y5-3 | Y5-4 | Y5-5 
y4-0 | Y4-1 | ya-2 | Ya-3 | Ya-4 | Y4-5 
y3-0 | Y3-1 | y3-2 | Y3-3 | Y3-4 | Y3-5 
y2-0 | Y2-1 | y2-2 | Y2-3 | Y2-4 | Y2-5 
yi-o0 | yi-1 | yt-2 | y1-3 | vt-4 | Y1-5 
Yo-o0 | Yo-1 | yo-2 | Yo-3 | Yo-4 | Yo-5 |16 BITS 
PER 
CB7-0 | CR7-0 | CB7-2 | CR7-2 | CB7-4 | CR7-4 |SAMPLE 
cBe-0 | crR6e-0 | cBs-2 | crRe-2 | cBe-4 | CRe-4 
cBs-0 | cr5-0 | cB5-2 | cr5-2 | cB5-4 | CR5-4 
cB4-0 | cr4-0 | cB4-2 | cr4-2 | cB4-4 | cR4-4 
cB3-0 | cR3-0 | cB3-2 | cR3-2 | cB3-4 | CR3-4 
cB2-0 | cR2-0 | cB2-2 | cr2-2 | cB2-4 | CR2-4 
cB1-0 | cR1-0 | cBi-2 | cr1-2 | cB1-4 | cR1-4 
cBo-0 | cRo-0 | cBo-2 | crRo-2 | cBo-4 | CRo-4 
-0 = SAMPLE 0 DATA 
-1 = SAMPLE 1 DATA 
-2 = SAMPLE 2 DATA 
-3 = SAMPLE 3 DATA 
-4 = SAMPLE 4 DATA 
Figure 3.4. 4:2:2 Frame Buffer Formatting. 
SAMPLE | SAMPLE | SAMPLE | SAMPLE | SAMPLE | SAMPLE 
0 1 2 3 4 5 
ACTIVE Aon FIELD: y7-0 | Y7-1 | Y7-2 | Y7-3 | Y7-4 | Y7-5 
EINE: [X= FIELD “2 Ye-o0 | Ye-1 | Ye-2 | Ye-3 | Y6e-4 | Ye-5 
NUMBER Ys-0 | Y5-1 | Y5-2 | Y5-3 | Y5-4 | Y5-5 
iee=——~ @—e y4-0 | Y4-1 | Y4-2 | Y4-3 | Y4-4 | y4-5 
y3-0 | Y3-1 | Y3-2 | Y3-3 | Y3-4 | y3-5 
y2-0 | Y2-1 | y2-2 | y2-3 | Y2-4 | y2-5 | 142 BITS 
[1] —@—e—e @—e yt-o | yt-1 | yi-2 | yi-3 | v1-4 | v1-5 | PER 
Yo-o0 | Yo-1 | Yo-2 | Yo-3 | Yo-4 | yo-5 |SAMPLE 
2 —@—e—e © 
cB7-0 | cB5-0 | cB3-0 | cB1-0 | CB7-4 | cBS-4 
[2] —@—e—e e cBe-0 | cB4-0 | cB2-0 | cBo-o | CB6-4 | cB4-4 
cr7-0 | cR5-0 | CR3-0 | CR1-0 | CR7-4 | CR5-4 
26 =s—~ 7 cr6-o | cR4-0 | CR2-0 | CRo-0 | CR6-4 | CR4-4 
-0 = SAMPLE 0 DATA 
© 8, CR SAMPLE -1 = SAMPLE 1 DATA 
-2 = SAMPLE 2 DATA 
@ =Y SAMPLE -3 = SAMPLE 3 DATA 
-4 = SAMPLE 4 DATA 


Figure 3.5. 4:1:1 Co-Sited Sampling. The 
sampling positions on the active scan lines 


of an interlaced picture. 


Figure 3.6. 4:1:1 Frame Buffer Formatting. 
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ACTIVE ACTIVE 
LINE LINE 
NUMBER NUMBER 
1 oe oe e @ @ @ 1 @ @ oe @ @ oe 
O O O O O O 
2 @ ® @ @ @ oe 2 @ @ @ @ @ @ 
3 @ @ oe @ @ @ 3 @ @ @ @ @ @ 
O O O O O O 
4 @ @ @ @ @ oe 4 e @ @ @ @ e 
5 oe @ @ @ e oe 5 e @ e @ oe @ 
O CALCULATED CB, CR SAMPLE O CALCULATED CB, CR SAMPLE 
@ Y SAMPLE @ = Y SAMPLE 
Figure 3.7. 4:2:0 Sampling for H.261, H.263, Figure 3.8. 4:2:0 Sampling for MPEG 2. The 
and MPEG 1. The sampling positions on the sampling positions on the active scan lines of 
active scan lines of a progressive or a progressive or noninterlaced picture. 
noninterlaced picture. 
o o o o ° ° -—£& n > 
S. | a> | 52/42] £ | Eo | 28] 3 | Ba | #8 
oa oa of oO ° ow wf 5 ol aq 
¢ ip $a | Ha a a a8 a == 
4:2:2 Co-Sited x x x 
4:1:1 Co-Sited x x x 
4:2:0 x x 
4:2:0 Co-Sited x x 


Table 3.3. YCbCr Formats for Various DV Applications. 
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ACTIVE FIELD N 
LINE 
NUMBER 
1 e e e e 
O O O 
1 
2 e @ e e 
2 
3 e e oe od 
O © O 
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4 —e—e——_e—_e—_e__e—-- 
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O CALCULATED CB, CR SAMPLE 


@ Y SAMPLE 
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Figure 3.9. 4:2:0 Sampling for MPEG 2. The sampling positions on the active scan lines of an 
interlaced picture (top_field_first = 1). 
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O CALCULATED CB, CR SAMPLE 
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Figure 3.10. 4:2:0 Sampling for MPEG 2. The sampling positions on the active scan lines of an 
interlaced picture (top_field_first = 0). 
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Figure 3.11. 4:2:0 Co-Sited Sampling for 576-Line DV and DVCAM. The sampling positions on the 


active scan lines of an interlaced picture. 


PhotoYCC Color Space 


PhotoYCC (a trademark of Eastman Kodak 
Company) was developed to encode Photo CD 
image data. The goal was to develop a display- 
device-independent color space. For maximum 
video display efficiency, the color space is 
based upon ITU-R BT.601 and BT.709. 

The encoding process (RGB to PhotoYCC) 
assumes CIE Standard Iluminant Dg; and that 
the spectral sensitivities of the image capture 
system are proportional to the color-matching 
functions of the BT:709 reference primaries. 
The RGB values, unlike those for a computer 
graphics system, may be negative. PhotoYCC 
includes colors outside the BT:709 color 
gamut; these are encoded using negative val- 
ues. 


RGB to PhotoYCC 
Linear RGB data (normalized to have values of 
0 to 1) is nonlinearly transformed to PhotoYCC 
as follows: 
for R, G, B= 0.018 

R’ = 1.099 R®-4 — 0.099 

G’ = 1.099 G° — 0.099 

B’ = 1.099 B®-* — 0.099 


for -0.018 < R, G, B < 0.018 


R’=4.5R 
G'=4.5G 
B’=45B 
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for R, G, B< -0.018 
R’ = - 1.099 |R|°-* - 0.099 
G’ =- 1.099 |G|°-° - 0.099 
B’ = — 1.099 |B|°-4° - 0.099 
From R’G’B’ with a 0-255 range, a luma 


and two chrominance signals (C1 and C2) are 
generated: 


Y = 0.213R’ + 0.419G’ + 0.081B’ 
C1 =—- 0.131R’ - 0.256G’ + 0.387B’ + 156 
C2 = 0.373R’ — 0.312G’ — 0.061B’ + 137 


As an example, a 20% gray value (R, G, and 
B = 0.2) would be recorded on the PhotoCD 
disc using the following values: 


Y=79 
C1 = 156 
C2 = 137 


PhotoYCC to RGB 


Since PhotoYCC attempts to preserve the 
dynamic range of film, decoding PhotoYCC 
images requires the selection of a color space 
and range appropriate for the output device. 
Thus, the decoding equations are not always 
the exact inverse of the encoding equations. 
The following equations are suitable for gener- 
ating RGB values for driving a CRT display, 
and assume a unity relationship between the 
luma in the encoded image and the displayed 
image. 


R’ = 0.981Y + 1.315(C2 - 137) 


G’ = 0.981Y - 0.311(C1- 156) 
— 0.669 (C2 - 137) 


B’ =0.981Y + 1.601 (C1 - 156) 


The R’G’B’ values should be saturated to a 
range of 0 to 255. The equations above assume 
the display uses phosphor chromaticities that 
are the same as the BT:709 reference prima- 
ries, and that the video signal luma (V) and the 
display luminance (L) have the relationship: 


for V > 0.0812 
L= ((V + 0.099) / 1.099) 1/9-45 


for V < 0.0812 
L=V/45 


HSI, HLS, and HSV Color 
Spaces 


The HSI (hue, saturation, intensity) and HSV 
(hue, saturation, value) color spaces were 
developed to be more “intuitive” in manipulat- 
ing color and were designed to approximate 
the way humans perceive and interpret color. 
They were developed when colors had to be 
specified manually, and are rarely used now 
that users can select colors visually or specify 
Pantone colors. These color spaces are dis- 
cussed for “historic” interest. HLS (hue, light- 
ness, saturation) is similar to HSI; the term 
lightness is used rather than intensity. 

The difference between HSI and HSV is 
the computation of the brightness component 
(I or V), which determines the distribution and 
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dynamic range of both the brightness (I or V) 
and saturation (S). The HSI color space is best 
for traditional image processing functions such 
as convolution, equalization, histograms, and 
so on, which operate by manipulation of the 
brightness values since I is equally dependent 
on R, G, and B. The HSV color space is pre- 
ferred for manipulation of hue and saturation 
(to shift colors or adjust the amount of color) 
since it yields a greater dynamic range of satu- 
ration. 

Figure 3.12 illustrates the single hexcone 
HSV color model. The top of the hexcone cor- 
responds to V = 1, or the maximum intensity 
colors. The point at the base of the hexcone is 
black and here V = 0. Complementary colors 
are 180° opposite one another as measured by 
H, the angle around the vertical axis (V), with 
red at 0°. The value of S is a ratio, ranging from 
0 on the center line vertical axis (V) to 1 on the 
sides of the hexcone. Any value of S between 0 
and 1 may be associated with the point V = 0. 
The point S = 0, V = 1 is white. Intermediate 
values of V for S = 0 are the grays. Note that 
when S = 0, the value of H is irrelevant. From 
an artist’s viewpoint, any color with V=1,S=1 
is a pure pigment (whose color is defined by 
H). Adding white corresponds to decreasing S 
(without changing V); adding black corre- 
sponds to decreasing V (without changing S). 
Tones are created by decreasing both S and V. 
Table 3.4 lists the 75% amplitude, 100% satu- 
rated HSV color bars. 

Figure 3.13 illustrates the double hexcone 
HSI color model. The top of the hexcone corre- 
sponds to I = 1, or white. The point at the base 
of the hexcone is black and here I = 0. Comple- 
mentary colors are 180° opposite one another 
as measured by H, the angle around the verti- 
cal axis (I), with red at 0° (for consistency with 
the HSV model, we have changed from the 
Tektronix convention of blue at 0°). The value 
of S ranges from 0 on the vertical axis (I) to 1 


on the surfaces of the hexcone. The grays all 
have S = 0, but maximum saturation of hues is 
at S = 1, 1 = 0.5. Table 3.5 lists the 75% ampli- 
tude, 100% saturated HSI color bars. 


Chromaticity Diagram 


The color gamut perceived by a person with 
normal vision (the 1931 CIE Standard 
Observer) is shown in Figure 3.14. The dia- 
gram and underlying mathematics were 
updated in 1960 and 1976; however, the NTSC 
television system is based on the 1931 specifi- 
cations. 

Color perception was measured by viewing 
combinations of the three standard CIE (Inter- 
national Commission on Illumination or Com- 
mission Internationale de lEclairage) primary 
colors: red with a 700-nm wavelength, green at 
546.1 nm, and blue at 435.8 nm. These primary 
colors, and the other spectrally pure colors 
resulting from mixing of the primary colors, 
are located along the curved outer boundary 
line (called the spectrum locus), shown in Fig- 
ure 3.14. 

The ends of the spectrum locus (at red and 
blue) are connected by a straight line that rep- 
resents the purples, which are combinations of 
red and blue. The area within this closed 
boundary contains all the colors that can be 
generated by mixing light of different colors. 
The closer a color is to the boundary, the more 
saturated it is. Colors within the boundary are 
perceived as becoming more pastel as the cen- 
ter of the diagram (white) is approached. Each 
point on the diagram, representing a unique 
color, may be identified by its x and y coordi- 
nates. 

In the CIE system, the intensities of red, 
green, and blue are transformed into what are 
called the tristimulus values, which are repre- 
sented by the capital letters X, Y, and Z. These 
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YELLOW 
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CYAN RED 


180° 


MAGENTA 
300° 


Figure 3.12. Single Hexcone HSV Color Model. 


Se = 2 °o 6 s a ra) 

0° to 360° - 60° 180° 120° 300° 0° 240° - 

S Otol 0 1 1 1 ak 1 1 0 
Otol 0.75 0.75 0.75 0.75 0.75 0.75 0.75 


Table 3.4. 75% HSV Color Bars. 
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WHITE 


YELLOW 
60° 


CYAN RED 


180° 


MAGENTA 
300° 


Figure 3.13. Double Hexcone HSI Color Model. For consistency with the 
HSV model, we have changed from the Tektronix convention of blue at 0 ° 
and depict the model as a double hexcone rather than as a double cone. 


—_ iy] 

ge Sol! dee Wee dls eat Se Es ore 
H 0° to 360° - 60° 180° 120° 300° 0° 240° - 
S Otol 0 1 1 1 dl. 1 1 0 

Otol 0.75 0.375 0.375 0.375 0.375 0.375 0.375 


Table 3.5. 75% HSI Color Bars. For consistency with the HSV model, we have changed 
from the Tektronix convention of blue at 0°. 
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values represent the relative quantities of the 
primary colors. 

The coordinate axes of Figure 3.14 are 
derived from the tristimulus values: 


x =X/(X + Y+Z) 
= red/(red + green + blue) 


y=Y/(X+Y+Z) 
= green/(red + green + blue) 


Z=Z/(K+Y+Z) 
= blue/(red + green + blue) 


The coordinates x, y, and z are called chro- 
maticity coordinates, and they always add up to 
1. As a result, z can always be expressed in 
terms of x and y, which means that only x and y 
are required to specify any color, and the dia- 
gram can be two-dimensional. 

Typically, a source or display specifies 
three (x, y) coordinates to define the three pri- 
mary colors it uses. The triangle formed by the 
three (x, y) coordinates encloses the gamut of 
colors that the source or display can repro- 
duce. This is shown in Figure 3.15, which com- 
pares the color gamuts of NTSC, PAL, and 
typical inks and dyes. 

Note that no set of three colors can gener- 
ate all possible colors, which is why television 
pictures are never completely accurate. For 
example, a television cannot reproduce mono- 
chromatic yellow-green (540 nm) since this 
color lies outside the triangle formed by red, 
green, and blue. 

In addition, a source or display usually 
specifies the (x, y) coordinate of the white color 
used, since pure white is not usually captured 
or reproduced. White is defined as the color 
captured or produced when all three primary 
signals are equal, and it has a subtle shade of 
color to it. Note that luminance, or brightness 
information, is not included in the standard 


CIE 1931 chromaticity diagram, but is an axis 
that is orthogonal to the (%, y) plane. The 
lighter a color is, the more restricted the chro- 
maticity range is. 

The chromaticities and reference white 
(CIE illuminate C) for the 1953 NTSC standard 
are: 


R: x,=0.67 y, = 0.33 
G. xg=0.21 Yg= 9.71 
B: xXp,= 0.14 Vp = 0.08 


white: Xw=0.3101 yy, = 0.3162 


Modern NTSC systems use a different set 
of RGB phosphors, resulting in slightly differ- 
ent chromaticities of the RGB primaries and 
reference white (CIE illuminate D¢;): 


R: x,=0.630  y,= 0.340 
G: xg=0.310  y,= 0.595 
B: x,=0.155 y= 0.070 


white: Xw=0.3127 yy, = 0.3290 


As illustrated in Figure 3.15, the color 
accuracy of television receivers has declined in 
order to increase the brightness. 

The chromaticities and reference white 
(CIE illuminate Dgs) for PAL and SECAM are: 


R: x,=0.64 ~— y,=0.33 
G: xg=0.29 y= 0.60 
B: xp=0.15  yp,= 0.06 


white: Xw=0.3127 yy, = 0.3290 


The chromaticities and reference white 
(CIE illuminate Dgs) for HDTV are based on 
BT.709: 
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R: x,=064 ~— y,=0.33 
G: xg=030 y= 0.60 
B: x,=0.15 y,= 0.06 


white: Xw= 0.3127 yw = 0.3290 
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Figure 3.14. CIE 1931 Chromaticity Diagram 


Showing Various Color Regions. 
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Non-RGB Color Space 


Considerations 


When processing information in a non-RGB 
color space (such as YIQ, YUV, or YCbCr), 
care must be taken that combinations of values 
are not created that result in the generation of 
invalid RGB colors. The term invalid refers to 
RGB components outside the normalized RGB 


limits of (1, 1, 1). 
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Figure 3.15. CIE 1931 Chromaticity Diagram 


Showing Various Color Gamuts. 
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ALL POSSIBLE 
Y = 255, 


YCBCR VALUES 
Ze CB =CR = 128 


255 


YCBCR VALID 
COLOR BLOCK 


= 255 


R = RED 

G = GREEN 

B = BLUE 

Y = YELLOW 
C = CYAN 

M = MAGENTA 
W = WHITE 
BK = BLACK 


Figure 3.16. RGB Limits Transformed into 3-D YCbCr Space. 


For example, given that RGB has a normal- 
ized value of (1, 1, 1), the resulting YCbCr 
value is (235, 128, 128). If Cb and Cr are manip- 
ulated to generate a YCbCr value of (235, 64, 
73), the corresponding RGB normalized value 
becomes (0.6, 1.29, 0.56)—note that the green 
value exceeds the normalized value of 1. 

From this illustration it is obvious that 
there are many combinations of Y, Cb, and Cr 
that result in invalid RGB values; these YCbCr 
values must be processed so as to generate 
valid RGB values. Figure 3.16 shows the RGB 
normalized limits transformed into the YCbCr 
color space. 

Best results are obtained using a constant 
luma and constant hue approach—Y is not 


altered while Cb and Cr are limited to the max- 
imum valid values having the same hue as the 
invalid color prior to limiting. The constant hue 
principle corresponds to moving invalid CbCr 
combinations directly towards the CbCr origin 
(128, 128), until they lie on the surface of the 
valid YCbCr color block. 

When converting to the RGB color space 
from a non-RGB color space, care must be 
taken to include saturation logic to ensure 
overflow and underflow wrap-around condi- 
tions do not occur due to the finite precision of 
digital circuitry. 8-bit RGB values less than 0 
must be set to 0, and values greater than 255 
must be set to 255. 


32 ~=Chapter 3: Color Spaces 


Gamma Correction 


The transfer function of most displays pro- 
duces an intensity that is proportional to some 
power (referred to as gamma) of the signal 
amplitude. As a result, high-intensity ranges 
are expanded and low-intensity ranges are 
compressed (see Figure 3.17). This is an 
advantage in combatting noise, as the eye is 
approximately equally sensitive to equally rela- 
tive intensity changes. By “gamma correcting” 
the video signals before display, the intensity 
output of the display is roughly linear (the gray 
line in Figure 3.17), and transmission-induced 
noise is reduced. 

To minimize noise in the darker areas of 
the image, modern video systems limit the 
gain of the curve in the black region. This 
technique limits the gain close to black and 
stretches the remainder of the curve to main- 
tain function and tangent continuity. 

Although video standards assume a dis- 
play gamma of about 2.2, a gamma of about 2.4 
is more realistic for CRT displays. However, 
this difference improves the viewing in dimly 
lit environments, such as the home. More 
accurate viewing in brightly lit environments 
may be accomplished by applying another 
gamma factor of about 1.09 (2.4/2.2). 


Early NTSC Systems 


Early NTSC systems assumed a simple trans- 
form at the display, with a gamma of 2.2. RGB 
values are normalized to have a range of 0 to 1: 


R= R22 
Gz G22 
B= B22 


0.8 4 TRANSMITTED 
PRE-CORRECTION 


06 4 


0.4 4 


DISPLAY 
CHARACTERISTIC. 


0.2 4 


0.0 


+ 
0.0 0.2 0.4 0.6 0.8 1.0 


Figure 3.17. Effect of Gamma. 


To compensate for the nonlinear display, 
linear RGB data was “gamma-corrected” prior 
to transmission by the inverse transform. RGB 
values are normalized to have a range of 0 to 1: 


Rs R1/2.2 
@z Gi/22 
B’ - B1/22 
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Early PAL and SECAM Systems 


Most early PAL and SECAM systems assumed 
a simple transform at the display, with a 
gamma of 2.8. RGB values are normalized to 
have a range of 0 to 1: 


R= R28 
Gs G28 
B= B28 


To compensate for the nonlinear display, 
linear RGB data was “gamma-corrected” prior 
to transmission by the inverse transform. RGB 
values are normalized to have a range of 0 to 1: 


R = R1/28 

G’-Gl/28 

B’= Bi/28 
Current Systems 


Current NTSC and HDTV systems assume the 
following transform at the display, with a 
gamma of [1/0.45]. RGB values are normal- 
ized to have a range of 0 to 1: 


for (R’, G’, B’)) < 0.0812 
R=R'/45 
G=G'/45 
B=B'/45 


for (R’, G’, B’) 2 0.0812 
R = ((R’ + 0.099) / 1.099) 1/0-45 
G = ((G’ + 0.099) / 1.099) 1/049 
B = ((B’ + 0.099) / 1.099) 1/045 


To compensate for the nonlinear display, 
linear RGB data is “gamma-corrected” prior to 
transmission by the inverse transform. RGB 
values are normalized to have a range of 0 to 1: 


for R, G, B< 0.018 


R’'=45R 
G'=4.5G 
B’=4.5B 


for R, G, B= 0.018 
R’ = 1.099 R®-* — 0.099 
G’ = 1.099 G°** — 0.099 
B’ = 1.099 B°-*° — 0.099 


Although most PAL and SECAM standards 
specify a gamma of 2.8, a value of [1/0.45] is 
now commonly used. Thus, these equations 
are now also used for PAL and SECAM sys- 
tems. 


Non-CRT Displays 


Many non-CRT displays have a different 
gamma than that used for video. To simplify 
interfacing, the display drive electronics are 
usually designed to present a standard CRT 
transform, with a gamma of [1/0.45]. The dis- 
play drive electronics then compensate for the 
actual gamma of the display device. 


34 = Chapter 3: Color Spaces 


References 


1. 


2. 


Benson, K. Blair, Television Engineering 
Handbook. McGraw-Hill, Inc., 1986. 

Clarke, C.K.P, 1986, Colour Encoding and 
Decoding Techniques for Line-Locked Sam- 
pled PAL and NTSC Television Signals, 
BBC Research Department Report BBC 
RD1986/2. 


. Devereux, V. G., 1987, Limiting of YUV dig- 


ital video signals, BBC Research Depart- 
ment Report BBC RD1987 22. 


. EIA Standard EJA-189-A, July 1976, 


Encoded Color Bar Signal. 


. Faroudja, Yves Charles, NTSC and Beyond. 


IEEE Transactions on Consumer Electron- 
ics, Vol. 34, No. 1, February 1988. 


. ITU-R BT.470-6, 1998, Conventional Televi- 


sion Systems. 


. ITU-R BT-601-5, 1995, Studio Encoding 


Parameters of Digital Television for 
Standard 4:3 and Widescreen 16:9 Aspect 
Ratios. 


. ITU-R BT:709-4, 2000, Parameter Values for 


the HDTV Standards for Production and 
International Programme Exchange. 


. Photo CD Information Bulletin, Fully Uti- 


lizing Photo CD Images—PhotoYCC Color 
Encoding and Compression Schemes, May 
1994, Eastman Kodak Company. 


| chapter 4 | 4 


Video Signals 
Overview 


Video signals come in a wide variety of 
options—number of scan lines, interlaced vs. 
progressive, analog vs. digital, etc. This chap- 
ter provides an overview of the common video 
signal formats and their timing. 


Digital Component Video 
Background 


In digital component video, the video signals 
are in digital form (YCbCr or R’G’B), being 
encoded to composite NTSC, PAL, or SECAM 
only when it is necessary for broadcasting or 
recording purposes. 

The European Broadcasting Union (EBU) 
became interested in a standard for digital 
component video due to the difficulties of 
exchanging video material between the 625- 
line PAL and SECAM systems. The format 
held the promise that the digital video signals 
would be identical whether sourced in a PAL 
or SECAM country, allowing subsequent en- 
coding to the appropriate composite form for 
broadcasting. Consultations with the Society of 


Motion Picture and Television Engineers 
(SMPTE) resulted in the development of an 
approach to support international program 
exchange, including 525-line systems. 

A series of demonstrations was carried out 
to determine the quality and suitability for sig- 
nal processing of various methods. From these 
investigations, the main parameters of the digi- 
tal component coding, filtering, and timing 
were chosen and incorporated into ITU-R 
BT.601. BT.601 has since served as the starting 
point for other digital component video stan- 
dards. 


Coding Ranges 


The selection of the coding ranges balanced 
the requirements of adequate capacity for sig- 
nals beyond the normal range and minimizing 
quantizing distortion. Although the black level 
of avideo signal is reasonably well defined, the 
white level can be subject to variations due to 
video signal and equipment tolerances. Noise, 
gain variations, and transients produced by fil- 
tering can produce signal levels outside the 
nominal ranges. 
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8 or 10 bits per sample are used for each of 
the YCbCr or R’G’B’ components. Although 8- 
bit coding introduces some quantizing distor- 
tion, it was originally felt that most video 
sources contained sufficient noise to mask 
most of the quantizing distortion. However, if 
the video source is virtually noise-free, the 
quantizing distortion is noticeable as contour- 
ing in areas where the signal brightness gradu- 
ally changes. In addition, at least two additional 
bits of fractional YCbCr or R’G’B’ data were 
desirable to reduce rounding effects when 
transmitting between equipment in the studio 
editing environment. For these reasons, most 
pro-video equipment uses 10-bit YCbCr or 
R’G'B’, allowing 2 bits of fractional YCbCr or 
R’G'B’ data to be maintained. 

Initial proposals had equal coding ranges 
for all three YCbCr components. However, this 
was changed so that Y had a greater margin for 
overloads at the white levels, as white level lim- 
iting is more visible than black. Thus, the nom- 
inal 8-bit Y levels are 16-235, while the nominal 
8-bit CbCr levels are 16-240 (with 128 corre- 
sponding to no color). Occasional excursions 
into the other levels are permissible, but never 
at the 0 and 255 levels. 

For 8-bit systems, the values of 00y and 
FFy are reserved for timing information. For 
10-bit systems, the values of 000y-003y and 
3FCy-3FFy are reserved for timing informa- 
tion, to maintain compatibility with 8-bit sys- 
tems. 

The YCbCr or R’G’B’ levels to generate 
75% color bars are discussed in Chapter 3. Dig- 
ital R’G’B’ signals are defined to have the same 
nominal levels as Y to provide processing mar- 
gin and simplify the digital matrix conversions 
between R’G’B’ and YCbCr. 


BT.601 Sampling Rate Selection 


Line-locked sampling of analog R’G’B’ or YUV 
video signals is done. This technique produces 
a static orthogonal sampling grid in which 
samples on the current scan line fall directly 
beneath those on previous scan lines and 
fields, as shown Figures 3.2 through 3.11. 

Another important feature is that the sam- 
pling is locked in phase so that one sample is 
coincident with the 50% amplitude point of the 
falling edge of analog horizontal sync (Oy). 
This ensures that different sources produce 
samples at nominally the same positions in the 
picture. Making this feature common simpli- 
fies conversion from one standard to another. 

For 525-line and 625-line video systems, 
several Y sampling frequencies were initially 
examined, including four times F,,. However, 
the four-times F,, sampling rates did not sup- 
port the requirement of simplifying interna- 
tional exchange of programs, so they were 
dropped in favor of a single common sampling 
rate. Because the lowest sample rate possible 
(while still supporting quality video) was a 
goal, a 12-MHz sample rate was preferred for a 
long time, but eventually was considered to be 
too close to the Nyquist limit, complicating the 
filtering requirements. When the frequencies 
between 12 MHz and 14.3 MHz were exam- 
ined, it became evident that a 13.5-MHz sample 
rate for Y provided some commonality 
between 525- and 625-line systems. Cb and Cr, 
being color difference signals, do not require 
the same bandwidth as the Y, so may be sam- 
pled at one-half the Y sample rate, or 6.75 
MHz. The accepted notation for a digital com- 
ponent system with sampling frequencies of 
13.5, 6.75, and 6.75 MHz for the luma and color 
difference signals, respectively, is 4:2:2 
(Y:Cb:Cr). 
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With 13.5-MHz sampling, each scan line 
contains 858 samples (525-line systems) or 864 
samples (625-line systems) and consists of a 
digital blanking interval followed by an active 
line period. Both the 525- and 625-line systems 
use 720 samples during the active line period. 
Having a common number of samples for the 
active line period simplifies the design of multi- 
standard equipment and standards conversion. 
With a sample rate of 6.75 MHz for Cb and Cr 
(4:2:2 sampling), each active line period con- 
tains 360 Cr samples and 360 Cb samples. 

With analog systems, problems may arise 
with repeated processing, causing an exten- 
sion of the blanking intervals and softening of 
the blanking edges. Using 720 digital samples 
for the active line period accommodates the 
range of analog blanking tolerances of both the 
525- and 625-line systems. Therefore, repeated 
processing may be done without affecting the 
digital blanking interval. Blanking to define 
the analog picture width need only be done 
once, preferably at the display or conversion to 
composite video. 

Initially, BT:601 supported only 525- and 
625-line interlaced systems with a 4:3 aspect 
ratio (720 x 480 and 720 x 576 active resolu- 
tions). Support for a 16:9 aspect ratio was then 
added (960 x 480 and 960 x 576 active resolu- 
tions) using an 18 MHz sample rate. 


Timing Information 


Instead of the conventional horizontal sync, 
vertical sync, and blank timing control signals, 
H (horizontal blanking), V (vertical blanking), 
and F (field) control signals are used: 


F = “0” for Field 1 F = “1” for Field 2 
V = “1” during vertical blanking 
H = “1” during horizontal blanking 


For progressive video systems, F is always a 
“0” since there is no field information. 


480-Line and 525-Line Video 
Systems 


Interlaced Analog Component Video 


Analog component signals are comprised of 
three signals, analog R’G’B’ or YPbPr. 
Referred to as 480i (since there are typically 
480 active scan lines per frame and it’s inter- 
laced), the frame rate is usually 29.97 Hz (30/ 
1.001) for compatibility with (M) NTSC timing. 
The analog interface uses 525 lines per frame, 
with active video present on lines 23-262 and 
286-525, as shown in Figure 4.1. 

For the 29.97 Hz frame rate, each scan line 
time (H) is about 63.556 us. Detailed horizon- 
tal timing is dependent on the specific video 
interface used, as discussed in Chapter 5. 


Interlaced Analog Composite Video 


(M) NTSC and (M) PAL are analog composite 
video signals that carry all timing and color 
information within a single signal. Using 525 
total lines per frame, they are commonly 
referred to as 525-line systems. They are dis- 
cussed in detail in Chapter 8. 


Progressive Analog Component Video 


Analog component signals are comprised of 
three signals, analog R’G’B’ or YPbPr. 
Referred to as 480p (since there are typically 
480 active scan lines per frame and it’s progres- 
sive), the frame rate is usually 59.94 Hz (60/ 
1.001) for easier compatibility with (M) NTSC 
timing. The analog interface uses 525 lines per 
frame, with active video present on lines 45- 
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Figure 4.1. 525-Line Interlaced Vertical Interval Timing. 
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Figure 4.2. 525-Line Progressive Vertical Interval Timing. 
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524, as shown in Figure 4.2. Note that many 
early systems use lines 46-525 for active video. 
For the 59.94 Hz frame rate, each scan line 
time (H) is about 31.778 us. Detailed horizon- 
tal timing is dependent on the specific video 
interface used, as discussed in Chapter 5. 


Interlaced Digital Component Video 


BT.601 and SMPTE 267M specify the repre- 
sentation for 480-line digital R’G’B’ or YCbCr 
interlaced video signals, also referred to as 
480i. Active resolutions defined within BT:601 
and SMPTE 267M, their 1x Y and R’G’B’ sam- 
ple rates (F,), and frame rates, are: 


960 x 480 
720 x 480 


18.0 MHz 
13.5 MHz 


29.97 Hz 
29.97 Hz 


Other common active resolutions, their 1x 
sample rates (F,), and frame rates, are: 


SAMPLE RATE = 13.5 MHZ 


16 SAMPLES 


DIGITAL 
BLANKING 


864x480 16.38MHz 29.97 Hz 
704x480 13.50MHz 29.97 Hz 
640x480 12.27MHz 29.97 Hz 
044x480 10.43MHz 29.97 Hz 
528 x 480 9.900 MHz 29.97 Hz 
480x480 9.000MHz 29.97 Hz 
392 x 480 6.750MHz 29.97 Hz 


864 x 480 is a 16:9 square 


pixel format, 


DIGITAL ACTIVE LINE 


while 640 x 480 is a 4:3 square pixel format. 
Although the ideal 16:9 resolution is 854 x 480, 
864 x 480 supports the MPEG 16 x 16 block 
structure. The 704 x 480 format is done by 
using the 720 x 480 format, and blanking the 
first eight and last eight samples each active 
scan line. Example relationships between the 
analog and digital signals are shown in Figures 
4.3 through 4.7. 

The H (horizontal blanking), V (vertical 
blanking), and F (field) signals are as defined 
in Figure 4.8. 


_. 
138 SAMPLES 
(0-137) 


TOTAL LINE 


720 SAMPLES 
(138-857) 


(0-857) 


858 SAMPLES 
57. 


Figure 4.3. 525-Line Interlaced Analog - Digital Relationship 
(4:3 Aspect Ratio, 29.97 Hz Refresh, 13.5 MHz Sample Clock). 
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SAMPLE RATE = 18.0 MHZ 


21.5 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


184 SAMPLES 
(0-183) 


960 SAMPLES 
(184-1143) 


TOTAL LINE 


1144 SAMPLES 
(0-1143) 


Figure 4.4. 525-Line Interlaced Analog - Digital Relationship 
(16:9 Aspect Ratio, 29.97 Hz Refresh, 18 MHz Sample Clock). 


SAMPLE RATE = 12.27 MHZ 


22 SAMPLES 


DIGITAL 
BLANKING DIGITAL ACTIVE LINE 
140 SAMPLES 
(0-139) 


640 SAMPLES 
(140-779) 


TOTAL LINE 


780 SAMPLES 
(0-779) 


Figure 4.5. 525-Line Interlaced Analog - Digital Relationship 
(4:3 Aspect Ratio, 29.97 Hz Refresh, 12.27 MHz Sample Clock). 
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SAMPLE RATE = 10.43 MHZ 


«——=] 20 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


119 SAMPLES 
(0-118) 


544 SAMPLES 
(119-662) 


TOTAL LINE 


663 SAMPLES 
(0-662) 


Figure 4.6. 525-Line Interlaced Analog - Digital Relationship 
(4:3 Aspect Ratio, 29.97 Hz Refresh, 10.43 MHz Sample Clock). 


SAMPLE RATE = 9 MHZ 


10.7 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


92 SAMPLES 
(0-91) 


480 SAMPLES 
(92-571) 


TOTAL LINE 


572 SAMPLES 
(0-571) 


Figure 4.7. 525-Line Interlaced Analog - Digital Relationship 
(4:3 Aspect Ratio, 29.97 Hz Refresh, 9 MHz Sample Clock). 
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Figure 4.8. 525-Line Interlaced Digital Vertical Timing (480 Active Lines). F and V change state at 
the EAV sequence at the beginning of the digital line. Note that the digital line number changes 


LINE 1(V = 1) 


LINE 23 (V = 0) 


LINE 263 (V = 1) 


LINE 286 (V = 0) 


LINE 525 (V = 0) 


LINE F Vv 
NUMBER 
1-3 1 1 
4-22 0 1 
23-262 (0) 0 
263-265 te) 1 
266-285 1 1 
286-525 1 0 


state prior to start of horizontal sync, as shown in Figures 4.3 through 4.7. 
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Progressive Digital Component Video 


BT.1358 and SMPTE 293M specify the repre- 
sentation for 480-line digital R’G’B’ or YCbCr 
progressive video signals, also referred to as 
480p. Active resolutions defined within 
BT.1358 and SMPTE 293M, their 1x sample 
rates (F,), and frame rates, are: 


960 x 480 
720 x 480 


36.0 MHz 
27.0 MHz 


59.94 Hz 
59.94 Hz 


Other common active resolutions, their 1x 
Y and R’GB’ sample rates (F,), and frame 
rates, are: 


864x480 32.75 MHz 59.94 Hz 
704x480 27.00 MHz 59.94 Hz 
640x480 24.54MHz 59.94 Hz 
044x480 20.86MHz 59.94 Hz 
528 x 480 19.80MHz 59.94 Hz 
480x480 18.00MHz 59.94 Hz 
392 x 480 13.50 MHz 59.94 Hz 


SAMPLE RATE = 27.0 MHZ 


——| 16 SAMPLES 


864 x 480 is a 16:9 square pixel format, 
while 640 x 480 is a 4:3 square pixel format. 
Although the ideal 16:9 resolution is 854 x 480, 
864 x 480 supports the MPEG 16 x 16 block 
structure. The 704 x 480 format is done by 
using the 720 x 480 format, and blanking the 
first eight and last eight samples each active 
scan line. Example relationships between the 
analog and digital signals are shown in Figures 
4.9 through 4.12. 

The H (horizontal blanking) and V (verti- 
cal blanking) signals are as defined in Figure 
4.13. 


SIF and QSIF 


SIF is defined to have an active resolution of 
352 x 240. This may be obtained by scaling 
down the 704 x 480 active resolution by a fac- 
tor of two. Square pixel SIF is defined to have 
an active resolution of 320 x 240. 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


720 SAMPLES 


l_. | 
138 SAMPLES 
37) (138-857) 


TOTAL LINE 


858 SAMPLES 
(0-857) 


Figure 4.9. 525-Line Progressive Analog - Digital Relationship 
(4:3 Aspect Ratio, 59.94 Hz Refresh, 27 MHz Sample Clock). 
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SAMPLE RATE = 36.0 MHZ 


21.5 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 
| 
184 SAMPLES 960 SAMPLES 
(0-183) (184-1143) 
TOTAL LINE 


1144 SAMPLES 
(0-1143) 


Figure 4.10. 525-Line Progressive Analog - Digital Relationship 
(16:9 Aspect Ratio, 59.94 Hz Refresh, 36 MHz Sample Clock). 


SAMPLE RATE = 24.54 MHZ 


22 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 
. _ 
140 SAMPLES 640 SAMPLES 
(0-139) (140-779) 


TOTAL LINE 


780 SAMPLES 
(0-779) 


Figure 4.11. 525-Line Progressive Analog - Digital Relationship 
(4:3 Aspect Ratio, 59.94 Hz Refresh, 24.54 MHz Sample Clock). 
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SAMPLE RATE = 20.86 MHZ 


i——>| 20 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


119 SAMPLES 
(0-118) 


544 SAMPLES 
(119-662) 


TOTAL LINE 
663 SAMPLES 
(0-662) 


Figure 4.12. 525-Line Progressive Analog - Digital Relationship 
(4:3 Aspect Ratio, 59.94 Hz Refresh, 20.86 MHz Sample Clock). 


LINE 1 (V = 1) 
BLANKING 
LINE 46 (V = 0) 
LINE F v 
NUMBER 
1-45 0 1 
ACTIVE VIDEO 46-525 0 0 


LINE 525 (V = 0) 


Figure 4.13. 525-Line Progressive Digital Vertical Timing (480 Active Lines). V changes state at 
the EAV sequence at the beginning of the digital line. Note that the digital line number changes 
state prior to start of horizontal sync, as shown in Figures 4.9 through 4.12. 
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QSIF is defined to have an active resolu- 
tion of 176 x 120. This may be obtained by scal- 
ing down the 704 x 480 active resolution by a 
factor of four. Square pixel QSIF is defined to 
have an active resolution of 160 x 120. 


576-Line and 625-Line Video 
Systems 


Interlaced Analog Component Video 


Analog component signals are comprised of 
three signals, analog R’G’B’ or YPbPr. 
Referred to as 576i (since there are typically 
576 active scan lines per frame and it’s inter- 
laced), the frame rate is usually 25 Hz for com- 
patibility with PAL timing. The analog 
interface uses 625 lines per frame, with active 
video present on lines 23-310 and 336-623, as 
shown in Figure 4.14. 

For the 25 Hz frame rate, each scan line 
time (H) is 64 us. Detailed horizontal timing is 
dependent on the specific video interface used, 
as discussed in Chapter 5. 


Interlaced Analog Composite Video 


(B, D, G, H, I, N, Nc) PAL are analog compos- 
ite video signals that carry all timing and color 
information within a single signal. Using 625 
total lines per frame, they are commonly 
referred to as 625-line systems. They are dis- 
cussed in detail in Chapter 8. 


Progressive Analog Component Video 


Analog component signals are comprised of 
three signals, analog R’G’B’ or YPbPr. 
Referred to as 576p (since there are typically 
576 active scan lines per frame and it’s progres- 


sive), the frame rate is usually 50 Hz for com- 
patibility with PAL timing. The analog 
interface uses 625 lines per frame, with active 
video present on lines 45-620, as shown in Fig- 
ure 4.15, 

For the 50 Hz frame rate, each scan line 
time (H) is 32 us. Detailed horizontal timing is 
dependent on the specific video interface used, 
as discussed in Chapter 5. 


Interlaced Digital Component Video 


BT.601 specifies the representation for 576-line 
digital R’G’B’ or YCbCr interlaced video sig- 
nals, also referred to as 576i. Active resolutions 
defined within BT.601, their 1x Y and R’G’B’ 
sample rates (F,), and frame rates, are: 


960 x 576 
720 x 576 


18.0 MHz 
13.5 MHz 


25 Hz 
25 Hz 


Other common active resolutions, their 1x 
Y and R’G’B’ sample rates (F,), and frame 
rates, are: 


1024x576 19.67MHz 25 Hz 
768576 = 14.75MHz 25 Hz 
704576 13.550MHz 25 Hz 
044x576 10.43MHz 25Hz 
480x576 9.000MHz 25 Hz 


1024 x 576 is a 16:9 square pixel format, 
while 768 x 576 is a 4:3 square pixel format. 
The 704 x 576 format is done by using the 720 
x 576 format, and blanking the first eight and 
last eight samples each active scan line. Exam- 
ple relationships between the analog and digi- 
tal signals are shown in Figures 4.16 through 
4.19. 

The H (horizontal blanking), V (vertical 
blanking), and F (field) signals are as defined 
in Figure 4.20. 
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OF 
VSYNC. 
| 1 
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Figure 4.14. 625-Line Interlaced Vertical Interval Timing. 
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OF 
VSYNC / 
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Figure 4.15. 625-Line Progressive Vertical Interval Timing. 
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SAMPLE RATE = 13.5 MHZ 


12 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


144 SAMPLES 
(0-143) 


720 SAMPLES 
(144-863) 


TOTAL LINE 


864 SAMPLES 
(0-863) 


Figure 4.16. 625-Line Interlaced Analog - Digital Relationship 
(4:3 Aspect Ratio, 25 Hz Refresh, 13.5 MHz Sample Clock). 


SAMPLE RATE = 18.0 MHZ 


16 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


960 SAMPLES 


192 SAMPLES 
(0-191) (192-1151) 


TOTAL LINE 


1152 SAMPLES 
(0-1151) 


Figure 4.17. 625-Line Interlaced Analog - Digital Relationship 
(16:9 Aspect Ratio, 25 Hz Refresh, 18 MHz Sample Clock). 
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SAMPLE RATE = 14.75 MHZ 


21 SAMPLES 


DIGITAL 

BLANKING DIGITAL ACTIVE LINE 

768 SAMPLES 
(176-943) 


176 SAMPLES 
(0-175) 


TOTAL LINE 


944 SAMPLES 
(0-943) 


Figure 4.18. 625-Line Interlaced Analog - Digital Relationship 
(4:3 Aspect Ratio, 25 Hz Refresh, 14.75 MHz Sample Clock). 


SAMPLE RATE = 10.43 MHZ 


=> 17 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


544 SAMPLES 


124 SAMPLES 
(0-123) (124-667) 


TOTAL LINE 


668 SAMPLES 
(0-667) 


Figure 4.19. 625-Line Interlaced Analog - Digital Relationship 
(4:3 Aspect Ratio, 25 Hz Refresh, 10.43 MHz Sample Clock). 
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LINE 1 = LINE 1(V = 1) 
BLANKING 
LINE 23 (V = 0) 
FIELD 1 
F=0 
LINE F v 
NEN FIELD 1 Sines 
ACTIVE VIDEO 
1-22 0 1 
23-310 ) 0 
311-312 0 1 
Y LINE 311 (V = 1) 313-335 1 1 
LINE 313 = 336-623 1 0 
BLANKING ee : ; 
LINE 336 (V = 0) 
FIELD 2 
(F = 1) 
ODD FIELD 2 
ACTIVE VIDEO 
LINE 624 (V = 1) 
Y BLANKING 
LINE 625 1 LINE 625 (V = 1) 
H=1 
EAV 
H=0 


Figure 4.20. 625-Line Interlaced Digital Vertical Timing (576 Active Lines). F and V change state 
at the EAV sequence at the beginning of the digital line. Note that the digital line number changes 
state prior to start of horizontal sync, as shown in Figures 4.16 through 4.19. 
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Progressive Digital Component Video 


BT.1358 specifies the representation for 576- 
line digital R’G’B’ or YCbCr progressive sig- 
nals, also referred to as 576p. Active resolu- 
tions defined within BT:1358, their 1x Y and 
R’G’B’ sample rates (F,), and frame rates, are: 


960 x 576 
720 x 576 


36.0 MHz 
27.0 MHz 


50 Hz 
50 Hz 


Other common active resolutions, their 1x 
Y and R’GB’ sample rates (F,), and frame 
rates, are: 


1024x576 39.33 MHz 50Hz 
768x576 29.50MHz 50Hz 
704x576 27.00MHz 50Hz 
044x576 20.86MHz 50Hz 
480x576 18.00MHz 50Hz 


SAMPLE RATE = 27.0 MHZ 


=] 12 SAMPLES 


DIGITAL 
BLANKING 


1024 x 576 is a 16:9 square pixel format, 
while 768 x 576 is a 4:3 square pixel format. 
The 704 x 576 format is done by using the 720 
x 576 format, and blanking the first eight and 
last eight samples each active scan line. Exam- 
ple relationships between the analog and digi- 
tal signals are shown in Figures 4.21 through 
4.24, 

The H (horizontal blanking) and V (verti- 
cal blanking) signals are as defined Figure 
4.25. 


DIGITAL ACTIVE LINE 


I~ 
144 SAMPLES 
(0-143) 


720 SAMPLES 
(144-863) 


TOTAL LINE 


864 SAMPLES 
(0-863) 


Figure 4.21. 625-Line Progressive Analog - Digital Relationship 
(4:3 Aspect Ratio, 50 Hz Refresh, 27 MHz Sample Clock). 
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SAMPLE RATE = 36.0 MHZ 


16 SAMPLES 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 


192 SAMPLES 
(0-191) 


960 SAMPLES 
(192-1151) 


TOTAL LINE 


1152 SAMPLES 
(0-1151) 


Figure 4.22. 625-Line Progressive Analog - Digital Relationship 
(16:9 Aspect Ratio, 50 Hz Refresh, 36 MHz Sample Clock). 


SAMPLE RATE = 29.5 MHZ 


21 SAMPLES 


DIGITAL 
BLANKING 


176 SAMPLES 
(0-175) 


DIGITAL ACTIVE LINE 


768 SAMPLES 
(176-943) 


TOTAL LINE 


944 SAMPLES 
(0-943) 


Figure 4.23. 625-Line Progressive Analog - Digital Relationship 
(4:3 Aspect Ratio, 50 Hz Refresh, 29.5 MHz Sample Clock). 
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SAMPLE RATE = 20.86 MHZ 


17 SAMPLES 


DIGITAL 
BLANKING 


DIGITAL ACTIVE LINE 


124 SAMPLES 
(0-123) 


Figure 4.24. 625-Line Progressive Analog - Digital Relationship 
(4:3 Aspect Ratio, 50 Hz Refresh, 20.86 MHz Sample Clock). 


544 SAMPLES 
(124-667) 


TOTAL LINE 


668 SAMPLES 
(0-667) 


LINE 1 (V = 1) 


BLANKING 


ACTIVE VIDEO 


LINE 45 (V = 0) 


BLANKING 


LINE 621 (V = 1) 


H =0 
SAV 


LINE 625 (V = 1) 


LINE 
NUMBER 


1-44 
45-620 
621-625 


Figure 4.25. 625-Line Progressive Digital Vertical Timing (576 Active Lines). V changes state at 
the EAV sequence at the beginning of the digital line. Note that the digital line number changes 


state prior to start of horizontal sync, as shown in Figures 4.21 through 4.24. 
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720-Line and 750-Line Video 
Systems 


Progressive Analog Component Video 


Analog component signals are comprised of 
three signals, analog R’G’B’ or YPbPr. 
Referred to as 720p (since there are typically 
720 active scan lines per frame and it’s progres- 
sive), the frame rate is usually 59.94 Hz (60/ 
1.001) to simplify the generation of (M) NTSC 
video. The analog interface uses 750 lines per 
frame, with active video present on lines 26- 
745, as shown in Figure 4.26. 

For the 59.94 Hz frame rate, each scan line 
time (H) is about 22.24 us. Detailed horizontal 
timing is dependent on the specific video inter- 
face used, as discussed in Chapter 5. 


Progressive Digital Component Video 


SMPTE 296M specifies the representation for 
720-line digital R’G’B’ or YCbCr progressive 
signals, also referred to as 720p. Active resolu- 
tions defined within SMPTE 296M, their 1x Y 
and R’G’B’ sample rates (F,), and frame rates, 
are: 


1280x720 74.176 MHz 23.976 Hz 
1280x720 74.250 MHz 24.000 Hz 
1280x720 74.250 MHz 25.000 Hz 
1280x720 74.176 MHz 29.970 Hz 
1280 x 720 74.250 MHz 30.000 Hz 
1280 x 720 74.250 MHz 50.000 Hz 
1280x720 74.176 MHz 59.940 Hz 
1280 x 720 74.250 MHz 60.000 Hz 


Note that square pixels and a 16:9 aspect 
ratio are used. Example relationships between 
the analog and digital signals are shown in Fig- 
ures 4.27 and 4.28, and Table 4.1. The H (hori- 
zontal blanking) and V (vertical blanking) 
signals are as defined in Figure 4.29. 


/\ 


Alt 


744 745 746 750 1 2 


6 7 8 9 26 


Figure 4.26. 750-Line Progressive Vertical Interval Timing. 
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SAMPLE RATE = 74.176 OR 74.25 MHZ 


114 SAMPLES 
—____» 


DIGITAL 


BLANKING DIGITAL ACTIVE LINE 
= al -_ 
370 SAMPLES 1280 SAMPLES 
(0-369) (370-1649) 


TOTAL LINE 


1650 SAMPLES 
649) 


Figure 4.27. 750-Line Progressive Analog - Digital Relationship 
(16:9 Aspect Ratio, 59.94 Hz Refresh, 74.176 MHz Sample Clock 
and 60 Hz Refresh, 74.25 MHz Sample Clock). 


[C] SAMPLES 
Lgee eee gs 


DIGITAL 
BLANKING DIGITAL ACTIVE LINE 
mle -| 
[B] SAMPLES 1280 SAMPLES 
TOTAL LINE 
Be a 
[A] SAMPLES 


Figure 4.28. General 750-Line Progressive Analog - Digital Relationship. 
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i 1x Y Total i 
Active Frame Sample Horizontal Horizontal 
Horizontal Rate P i Blanking c 
Récolution (Hz) Rate Resolution (B) 
(MHz) (A) 
24/1.001 74.25/1.001 4125 2845 2589 
24 74.25 4125 2845 2589 
25 74.25 3960 2680 2424 
1280 30/1.001 74.25/1.001 3300 2020 1764 
30 74.25 3300 2020 1764 
50 74.25 1980 700 444 
60/1.001 74.25/1.001 1650 370 114 
60 74.25 1650 370 114 


Table 4.1. Various 750-Line Progressive Analog - Digital 
Parameters for Figure 4.28. 


LINE 1 (V = 1) 
BLANKING 


LINE 26 (V = 0) 


LINE F Vv 
NUMBER 

ACTIVE VIDEO 1-25 0 u 
26-745 0 ) 
746-750 0 1 


LINE 746 (V = 1) 


BLANKING 


LINE 750 (V = 1) 


Figure 4.29. 750-Line Progressive Digital Vertical Timing (720 Active Lines). V changes state at 
the EAV sequence at the beginning of the digital line. Note that the digital line number changes 
state prior to start of horizontal sync, as shown in Figures 4.27 and 4.28. 
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1080-Line and 1125-Line 
Video Systems 


Interlaced Analog Component Video 


Analog component signals are comprised of 
three signals, analog R’G’B’ or YPbPr. 
Referred to as 1080i (since there are typically 
1080 active scan lines per frame and it’s inter- 
laced), the frame rate is usually 25 or 29.97 Hz 
(30/1.001) to simplify the generation of (B, D, 
G, H, I) PAL or (M) NTSC video. The analog 
interface uses 1125 lines per frame, with active 
video present on lines 21-560 and 584-1123, as 
shown in Figure 4.30. 

MPEG 2 systems use 1088 lines, rather 
than 1080, in order to have a multiple of 32 
scan lines per frame. In this case, an additional 
4 lines per field after the active video are used. 

For the 25 Hz frame rate, each scan line 
time is about 35.56 us. For the 29.97 Hz frame 
rate, each scan line time is about 29.66 us. 
Detailed horizontal timing is dependent on the 
specific video interface used, as discussed in 
Chapter 5. 


Progressive Analog Component Video 


Analog component signals are comprised of 
three signals, analog R’G’B’ or YPbPr. 
Referred to as 1080p (since there are typically 
1080 active scan lines per frame and it’s pro- 
gressive), the frame rate is usually 50 or 59.94 
Hz (60/1.001) to simplify the generation of (B, 
D, G, H, I) PAL or (M) NTSC video. The ana- 
log interface uses 1125 lines per frame, with 
active video present on lines 42-1121, as 
shown in Figure 4.31. 

MPEG 2 systems use 1088 lines, rather 
than 1080, in order to have a multiple of 16 


scan lines per frame. In this case, an additional 
8 lines per frame after the active video are 
used. 

For the 50 Hz frame rate, each scan line 
time is about 17.78 us. For the 59.94 Hz frame 
rate, each scan line time is about 14.83 us. 
Detailed horizontal timing is dependent on the 
specific video interface used, as discussed in 
Chapter 5. 


Interlaced Digital Component Video 


ITU-R BT:709 and SMPTE 274M specify the 
digital component format for the 1080-line digi- 
tal R’G’B’ or YCbCr interlaced signal, also 
referred to as 1080i. Active resolutions defined 
within BT:709 and SMPTE 274M, their 1x Y 
and R’G’B’ sample rates (F,), and frame rates, 
are: 


1920 x 1080 74.250 MHz 25.00 Hz 
1920 x 1080 74.176 MHz 29.97 Hz 
1920 x 1080 74.250 MHz 30.00 Hz 


Note that square pixels and a 16:9 aspect 
ratio are used. Other common active resolu- 
tions, their 1x Y and R’G’B’ sample rates (F,), 
and frame rates, are: 


1280 x 1080 
1280 x 1080 
1280 x 1080 
1440 x 1080 
1440 x 1080 
1440 x 1080 


49.500 MHz 
49.451 MHz 
49.500 MHz 
50.688 MHz 
59.632 MHz 
50.688 MHz 


25.00 Hz 
29.97 Hz 
30.00 Hz 
25.00 Hz 
29.97 Hz 
30.00 Hz 


Example relationships between the analog 
and digital signals are shown in Figures 4.32 
and 4.33, and Table 4.2. The H (horizontal 
blanking) and V (vertical blanking) signals are 
as defined in Figure 4.34. 
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Figure 4.30. 1125-Line Interlaced Vertical Interval Timing. 


IN h 
Aa 
H 


Figure 4.31. 1125-Line Progressive Vertical Interval Timing. 
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SAMPLE RATE = 74.25 OR 74.176 MHZ 


88 SAMPLES 
—$__» 


DIGITAL 
BLANKING DIGITAL ACTIVE LINE 
Ln whe 
280 SAMPLES 1920 SAMPLES 
(0-279) (280-2199) 


TOTAL LINE 


2200 SAMPLES 
(0-2199) 


Figure 4.32. 1125-Line Interlaced Analog - Digital Relationship 
(16:9 Aspect Ratio, 29.97 Hz Refresh, 74.176 MHz Sample Clock 
and 30 Hz Refresh, 74.25 MHz Sample Clock). 


[D] SAMPLES 
Banihal 
DIGITAL 
BLANKING DIGITAL ACTIVE LINE 
ra «| 
[C] SAMPLES [A] SAMPLES 
TOTAL LINE 
be -| 
[B] SAMPLES 


Figure 4.33. General 1125-Line Interlaced Analog - Digital Relationship. 
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Active 


1x Y 


Total 


Horizontal Frame Sample Horizontal Horiontal 
Resolution Rate Rate Resolution Blankine , 
(A) mm (MHz) (B) a 
24/1.001 74.25/1.001 2750 830 638 
24 74.25 2750 830 638 
1920 25 74.25 2640 720 528 
30/1.001 74.25/1.001 2200 280 88 
30 74.25 2200 280 88 
24/1.001 55.6875/1.001 2062.5 622.5 478.5 
24 55.6875 2062.5 622.5 478.5 
1440 25 55.6875 1980 540 396 
30/1.001 55.6875/1.001 1650 210 66 
30 55.6875 1650 210 66 
24/1.001 49.5/1.001 1833.3 553.3 425.3 
24 49.5 1833.3 553.3 425.3 
1280 25 49.5 1760 480 352 
30/1.001 49.5/1.001 1466.7 186.7 58.7 
30 49.5 1466.7 186.7 58.7 


Table 4.2. Various 1125-Line Interlaced Analog - Digital 


Parameters for Figure 4.33. 
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LINE 1 = LINE 1(V = 1) 
BLANKING 
LINE 21 (V = 0) 
FIELD 1 
* O00 LINE F Vv 
ope FIELD 1 eee 
ACTIVE VIDEO 
1-20 0 1 
21-560 0 0 
561-562 ) 1 
Y LINE 561 (V = 1) 563-583 1 1 
LINE 583 —— 584-1123 1 0 
oe 1124-1125 1 1 
LINE 584 (V = 0) 
FIELD 2 
(F = 1) 
FIELD 2 
EVEN 
ACTIVE VIDEO 
LINE 1124 (V = 1) 
1 BLANKING 
LINE 1125 LINE 1125 (V = 1) 


Figure 4.34. 1125-Line Interlaced Digital Vertical Timing (1080 Active Lines). F and V change 
state at the EAV sequence at the beginning of the digital line. Note that the digital line number 
changes state prior to start of horizontal sync, as shown in Figures 4.32 and 4.33. 
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Progressive Digital Component Video 


ITU-R BT:709 and SMPTE 274M specify the 
digital component format for the 1080-line digi- 
tal R’G’B’ or YCbCr progressive signal, also 
referred to as 1080p. Active resolutions 
defined within BT.709 and SMPTE 274M, their 
1x Y and R’G’B’ sample rates (F,), and frame 
rates, are: 


1920 x 1080 
1920 x 1080 
1920 x 1080 
1920 x 1080 
1920 x 1080 
1920 x 1080 
1920 x 1080 
1920 x 1080 


74.176 MHz 23.976 Hz 
74.250 MHz 24.000 Hz 
74.250 MHz 25.000 Hz 
74.176 MHz 29.970 Hz 
74.250 MHz 30.000 Hz 
148.50 MHz 50.000 Hz 
148.35 MHz 59.940 Hz 
148.50 MHz 60.000 Hz 


Note that square pixels and a 16:9 aspect 
ratio are used. Other common active resolu- 
tions, their 1x Y and R’G’B’ sample rates (F,), 
and frame rates, are: 


1280 x 1080 
1280 x 1080 
1280 x 1080 
1280 x 1080 
1280 x 1080 
1280 x 1080 
1280 x 1080 
1280 x 1080 
1440 x 1080 
1440 x 1080 
1440 x 1080 
1440 x 1080 
1440 x 1080 
1440 x 1080 
1440 x 1080 
1440 x 1080 


49.451 MHz 
49.500 MHz 
49.500 MHz 
49.451 MHz 
49.500 MHz 
99.000 MHz 
98.901 MHz 
99.000 MHz 
59.632 MHz 
50.688 MHz 
50.688 MHz 
59.632 MHz 
59.688 MHz 
111.38 MHz 
111.26 MHz 
111.38 MHz 


23.976 Hz 
24.000 Hz 
25.000 Hz 
29.970 Hz 
30.000 Hz 
50.000 Hz 
59.940 Hz 
60.000 Hz 
23.976 Hz 
24.000 Hz 
25.000 Hz 
29.970 Hz 
30.000 Hz 
50.000 Hz 
59.940 Hz 
60.000 Hz 


Example relationships between the analog 
and digital signals are shown in Figures 4.35 
and 4.36, and Table 4.3. The H (horizontal 
blanking) and V (vertical blanking) bits are as 
defined in Figures 4.37. 


Computer Video Timing 


The Video Electronics Standards Association 
(VESA) defines the timing for progressive ana- 
log R’G’B’ signals that drive computer moni- 
tors. Some consumer products are capable of 
accepting these progressive analog R’G’B’ sig- 
nals and displaying them. Common active reso- 
lutions and their names are: 


640 x 400 VGA 
640 x 480 VGA 
854 x 480 SVGA 
800 x 600 SVGA 
1024 x 768 XGA 
1280 x 768 XGA 
1280 x 1024 SXGA 
1600 x 1200 UXGA 


Common refresh rates are 60, 72, 75 and 
85 Hz, although rates of 50-200 Hz may be sup- 
ported. 

Graphics controllers are usually very flexi- 
ble in programmability, allowing trading off 
resolution versus bits per pixel versus refresh 
rate. As a result, a large number of display 
combinations are possible. 
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SAMPLE RATE = 148.5 OR 148.35 MHZ 


88 SAMPLES 
—___» 


DIGITAL 
BLANKING DIGITAL ACTIVE LINE 
| al | 
280 SAMPLES 1920 SAMPLES 
(0-279) (280-2199) 


TOTAL LINE 


2200 SAMPLES 
99) 


Figure 4.35. 1125-Line Progressive Analog - Digital Relationship 
(16:9 Aspect Ratio, 59.94 Hz Refresh, 148.35 MHz Sample Clock 
and 60 Hz Refresh, 148.5 MHz Sample Clock). 
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Figure 4.36. General 1125-Line Progressive Analog - Digital Relationship. 


64 Chapter 4: 


Video Signals Overview 


Rete Frame cas Jotal Horizontal 
Horizontal Sample Horizontal j 
Resolution Rate Rate Resolution Blankine , 
(A) mm (MHz) (B) a 

24/1.001 74.25/1.001 2750 830 638 
24 74.25 2750 830 638 
25 74.25 2640 720 528 

1920 30/1.001 74.25/1.001 2200 280 88 
30 74.25 2200 280 88 
50 148.5 2640 720 528 
60/1.001 148.5/1.001 2200 280 88 
60 148.5 2200 280 88 
24/1.001 55.6875/1.001 2062.5 622.5 478.5 
24 55.6875 2062.5 622.5 478.5 
25 55.6875 1980 540 396 

1440 30/1.001 55.6875/1.001 1650 210 66 
30 55.6875 1650 210 66 
50 111.375 1980 540 396 
60/1.001 111.375/1.001 1650 210 66 
60 111.375 1650 210 66 
24/1.001 49.5/1.001 1833.3 553.3 425.3 
24 49.5 1833.3 593.3 425.3 
25 49.5 1760 480 352 

1280 30/1.001 49.5/1.001 1466.7 186.7 58.7 
30 49.5 1466.7 186.7 58.7 
50 99 1760 480 352 
60/1.001 99/1.001 1466.7 186.7 58.7 
60 99 1466.7 186.7 58.7 


Table 4.3. Various 1125-Line Progressive Analog - Digital 
Parameters for Figure 4.36. 
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BLANKING 


BLANKING 


H=0 
SAV 


LINE F Vv 
NUMBER 
ACTIVE VIDEO 1 0 1 
42-1121 0 0 
1122-1125 ) 1 


LINE 1 (V = 1) 


LINE 42 (V = 0) 


LINE 1122 (V = 1) 
LINE 1125 (V = 1) 


Figure 4.37. 1125-Line Progressive Digital Vertical Timing (1080 Active Lines). V changes state at 
the EAV sequence at the beginning of the digital line. Note that the digital line number changes 
state prior to start of horizontal sync, as shown in Figures 4.35 and 4.36. 
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Analog Video 
Interfaces 


For years, the primary video signal used by 
the consumer market has been composite 
NTSC or PAL video (Figures 8.2 and 8.13). 
Attempts have been made to support s-video, 
but, until recently, it has been largely limited to 
S-VHS VCRs and high-end televisions. 

With the introduction of DVD players, digi- 
tal settop boxes, and DTV, there has been 
renewed interest in providing high-quality 
video to the consumer market. This equipment 
not only supports very high quality composite 
and s-video signals, but many also allow the 
option of using analog R’G’B’ or YPbPr video. 

Using analog R’G’B’ or YPbPr video elimi- 
nates NTSC/PAL encoding and decoding arti- 
facts. As a result, the picture is sharper and has 
less noise. More color bandwidth is also avail- 
able, increasing the horizontal detail. 
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S-Video Interface 


The RCA phono connector (consumer market) 
or BNC connector (pro-video market) trans- 
fers a composite NTSC or PAL video signal, 
made by adding the intensity (Y) and color (C) 
video signals together. The television then has 
to separate these Y and C video signals in 
order to display the picture. The problem is 
that the Y/C separation process is never per- 
fect, as discussed in Chapter 9. 

Many video components now support a 4- 
pin s-video connector, illustrated in Figure 5.1 
(the female connector viewpoint). This connec- 
tor keeps the intensity (Y) and color (C) video 
signals separate, eliminating the Y/C separa- 
tion process in the TV. As a result, the picture 
is sharper and has less noise. Figures 9.2 and 
9.3 illustrate the Y signal, and Figures 9.10 and 
9.11 illustrate the C signal. 

VBI (vertical blanking interval) informa- 
tion, such as closed captioning and teletext, is 
present on the Y video signal. 
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ADC offset may be present on the C signal 
to indicate widescreen (16:9) program material 
is present. An offset of 5V indicates a 16:9 
anamorphic (squeezed) image is present. A 
16:9 TV detects the DC offset and expands the 
4:3 image to fill the screen, restoring the cor- 
rect aspect ratio of the program. Some systems 
also use an offset of 2.3V to indicate the pro- 
gram is letterboxed. 

The IEC 60933-5 standard specifies the s- 
video connector, including signal levels. 


Extended S-Video Interface 


The PC market also uses an extended s-video 
interface. This interface has 7 pins, as shown 
in Figure 5.1, and is backwards compatible 
with the 4-pin interface. 

The three additional pins are for an 2C 
interface (SDA bi-directional data pin and SCL 
clock pin) and a +12V power pin. 


7-PIN MINI DIN CONNECTOR 
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SCART Interface 


Most consumer video components in Europe 
support one or two 21-pin SCART connectors 
(also known as Peritel and Euroconnector). 
This connection allows analog R’G’B’ video or 
s-video, composite video, and analog stereo 
audio to be transmitted between equipment 
using a single cable. The composite video sig- 
nal must always be present, as it provides the 
basic video timing for the analog R’G’B’ video 
signals. Note that the 700 mV R’G'B’ signals 
do not have a blanking pedestal or sync infor- 
mation, as illustrated in Figure 5.4. 

VBI information, such as closed captioning 
and teletext, is present on the composite, Y, 
and R’G’B’ video signals. 

There are now several types of SCART 
pinouts, depending on the specific functions 
implemented, as shown in Tables 5.1 through 
5.3. Pinout details are shown in Figure 5.2. 

The IEC 60933-1 and 60933-2 standards 
specify the basic SCART connector, including 
signal levels. 


4-PIN MINI DIN CONNECTOR 


Figure 5.1. S-Video Connector and Signal Names. 
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Pin Function Signal Level Impedance 
1 audio right out (or audio mono out) 0.5v rms <1Kohm 
2 audio right in (or audio mono in) 0.5v rms >10K ohm 
3 audio left out (or audio mono out) 0.5v rms <1Kohm 
4 audio ground 
5 blue ground 
6 audio left in (or audio mono in) 0.5v rms > 10K ohm 
7 blue 0.7v 75 ohms 
8 function select 9.5-12V = AV mode > 10K ohm 

5-8V = widescreen mode 
0-2V = TV mode 
9 green ground 
10 data 2 
11 green 0.7v 75 ohms 
12 data 1 
13 red ground 
14 data ground 
15 red 0.7v 75 ohms 
16 RGB control 1-3v = RGB, 75 ohms 
0-0.4v = composite 
17 video ground 
18 RGB control ground 
19 composite video out lv 75 ohms 
20 composite video in lv 75 ohms 
21 safety ground 


Figure 5.2. SCART Connector. 


Table 5.1. SCART Connector Signals (Composite and RGB Video). 


SCART Interface 


Pin Function Signal Level Impedance 
1 audio right out (or audio mono out) 0.5v rms <1K ohm 
2 audio right in (or audio mono in) 0.5v rms >10K ohm 
3 audio left out (or audio mono out) 0.5v rms <1K ohm 
4 audio ground 
5 ground 
6 audio left in (or audio mono in) 0.5v rms > 10K ohm 
7 
8 function select 9.5-12V = AV mode > 10K ohm 

5-8V = widescreen mode 
0-2V = TV mode 
9 ground 

10 data 2 
11 
12 data 1 
13 ground 
14 data ground 
15 
16 
17 video ground 
18 
19 composite video out lv 75 ohms 

20 composite video in lv 75 ohms 

21 safety ground 


Table 5.2. SCART Connector Signals (Composite Video Only). 
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Pin Function Signal Level Impedance 
1 audio right out (or audio mono out) 0.5v rms <1K ohm 
2 audio right in (or audio mono in) 0.5v rms >10K ohm 
3 audio left out (or audio mono out) 0.5v rms <1K ohm 
4 audio ground 
5 ground 
6 audio left in (or audio mono in) 0.5v rms > 10K ohm 
7 composite video in! lv 75 ohms 
8 function select 9.5-12V = AV mode > 10K ohm 

5-8V = widescreen mode 
0-2V = TV mode 
9 ground 

10 data 2 
11 composite video in! lv 75 ohms 
12 data 1 
13 ground 
14 data ground 
15 chrominance video 0.3v burst 75 ohms 
16 
17 video ground 
18 
19 composite video out lv 75 ohms 

20 luminance video lv 75 ohms 

21 safety ground 

Notes: 


1. Japan adds these two composite signals to their implementation. 


Table 5.3. SCART Connector Signals (Composite and S-Video). 
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SDTV RGB Interface 


Some SDTV consumer video equipment sup- 
ports an analog R’G’B’ video interface. Verti- 
cal blanking interval (VBI) information, such 
as closed captioning and teletext, may be 
present on the R’G’B’ video signals. Three sep- 
arate RCA phono connectors (consumer mar- 
ket) or BNC connectors (pro-video and PC 
market) are used. 

The horizontal and vertical video timing 
are dependent on the video standard, as dis- 
cussed in Chapter 4. For sources, the video 
signal at the connector should have a source 
impedance of 75Q +5%. For receivers, video 
inputs should be AC-coupled and have a 75-Q 
+5% input impedance. The three signals must 
be coincident with respect to each other within 
+5 ns. 

Sync information may be present on just 
the green channel, all three channels, as a sep- 
arate composite sync signal, or as separate hor- 
izontal and vertical sync signals. A gamma of 
1/0.45 is used. 


7.5 IRE Blanking Pedestal 


As shown in Figure 5.3, the nominal active 
video amplitude is 714 mV, including a 7.5 +2 
IRE blanking pedestal. A 286 +6 mV composite 
sync signal may be present on just the green 
channel (consumer market), or all three chan- 
nels (pro-video market). DC offsets up to +1V 
may be present. 


Analog R’G’B’ Generation 

Assuming 10-bit D/A converters (DACs) with 
an output range of 0-1.305V, the 10-bit YCbCr 
to R’G’B’ equations are: 


R’ = 0.591 (Yeo, — 64) + 0.810(Cr - 512) 


G’ = 0.591 (Ygq1 - 64) - 0.413(Cr - 512) — 
0.199 (Cb - 512) 


B’ = 0.591 (Ygo1 - 64) + 1.025(Cb - 512) 


R’G’'B’ has a nominal 10-bit range of 0-518 
to match the active video levels used by the 
NTSC/PAL encoder in Chapter 9. Note that 
negative values of R’'G’B’ should be supported 
at this point. 

To implement the 7.5 IRE blanking pedes- 
tal, a value of 42 is added to the digital R’G’B’ 
data during active video. 0 is added during the 
blanking time. 

After the blanking pedestal is added, the 
R’G'B’ data is clamped by a blanking signal 
that has a raised cosine distribution to slow the 
slew rate of the start and end of the video sig- 
nal. For interlaced SDTV systems, blank rise 
and fall times are 140 +20 ns. For progressive 
SDTV systems, blank rise and fall times are 70 
+10 ns. 

Composite sync information may be added 
to the R’G’B’ data after the blank processing 
has been performed. Values of 16 (sync 
present) or 240 (no sync) are assigned. The 
sync rise and fall times should be processed to 
generate a raised cosine distribution (between 
16 and 240) to slow the slew rate of the sync 
signal. For interlaced SDTV systems, sync rise 
and fall times are 140 +20 ns, and horizontal 
sync width at the 50%point is 4.7 +0.1 us. For 
progressive SDTV systems, sync rise and fall 
times are 70 +10 ns, and horizontal syne width 
at the 50%-point is 2.33 +0.05 ys. 

At this point, we have digital R’G’B’ with 
sync and blanking information, as shown in 
Figure 5.3 and Table 5.4. The numbers in 
parentheses in Figure 5.3 indicate the data 
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1.020 V 


0.357 V 
0.306 V 


0.020 V 


1.020 V 


0.357 V 
0.306 V 


— 


WHITE LEVEL (800) 


100 IRE 

BLACK LEVEL (282) 

17.5 IRE 

BLANK LEVEL (240) 
40 IRE 

SYNC LEVEL (16) 

GREEN, BLUE, OR RED CHANNEL, SYNC PRESENT 

WHITE LEVEL (800) 

100 IRE 


BLACK LEVEL (282) 


17.5 IRE 
BLANK LEVEL (240) 


GREEN, BLUE, OR RED CHANNEL, NO SYNC PRESENT 


Figure 5.3. SDTV Analog RGB Levels. 7.5 IRE blanking level. 
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1.020 V 


0.321 V 


0.020 V 


1.020 V 


0.321 V 


100 IRE 


— 


43 IRE 


100 IRE 


WHITE LEVEL (800) 


BLACK / BLANK LEVEL (252) 


SYNC LEVEL (16) 


GREEN, BLUE, OR RED CHANNEL, SYNC PRESENT 


WHITE LEVEL (800) 


BLACK / BLANK LEVEL (252) 


GREEN, BLUE, OR RED CHANNEL, NO SYNC PRESENT 


Figure 5.4. SDTV Analog RGB Levels. 0 IRE blanking level. 


74 ~=Chapter 5: Analog Video Interfaces 


value for a 10-bit DAC with a fullscale output 
value of 1.305V. The digital R’G’B’ data may 
drive three 10-bit DACs that generate a 0- 
1.305V output to generate the analog R’G’B’ 
video signals. 

As the sample-and-hold action of the DAC 
introduces a (sin x)/x characteristic, the video 
data may be digitally filtered by a [(sin x) /x]7! 
filter to compensate. Alternately, as an analog 
lowpass filter is usually present after each 
DAC, the correction may take place in the ana- 
log filter. 


Video 7.5 IRE O IRE 
Level Blanking Pedestal | Blanking Pedestal 
white 800 800 
black 282 252 
blank 240 252 
sync 16 16 


Table 5.4. SDTV 10-Bit R°G°B’ Values. 


Analog R’G’B’ Digitization 

Assuming 10-bit A/D converters (ADCs) with 
an input range of 0-1.305V, the 10-bit R’G’B’ to 
YCbCr equations are: 


Yoo} = 0.506(R’ — 282) + 0.992 (G’ — 282) + 
0.193 (B’ — 282) + 64 


Cb = -0.291(R’ - 282) — 0.573 (G’ — 282) + 
0.864 (B’ — 282) + 512 


Cr = 0.864(R’ - 282) - 0.724(G’ — 282) - 
0.140(B’ — 282) + 512 


R’G'B’ has a nominal 10-bit range of 282- 
800 to match the active video levels used by 
the NTSC/PAL decoder in Chapter 9. Table 5.4 
and Figure 5.3 illustrate the 10-bit R’G’B’ val- 
ues for the white, black, blank, and (optional) 
sync levels. 


O IRE Blanking Pedestal 


As shown in Figure 5.4, the nominal active 
video amplitude is 700 mV, with no blanking 
pedestal. A 300 +6 mV composite sync signal 
may be present on just the green channel (con- 
sumer market), or all three channels (pro- 
video market). DC offsets up to +1V may be 
present. 


Analog R’G’B’ Generation 

Assuming 10-bit DACs with an output range of 
0-1.305V, the 10-bit YCbCr to R’G’B’ equations 
are: 


R’ = 0.625(Y go) — 64) + 0.857 (Cr - 512) 


G’ = 0.625(Ygo1 - 64) - 0.437(Cr - 512) — 
0.210(Cb - 512) 


B’ = 0.625(Yg1 - 64) + 1.084 (Cb - 512) 


R’G'B’ has a nominal 10-bit range of 0-548 
to match the active video levels used by the 
NTSC/PAL encoder in Chapter 9. Note that 
negative values of R’'G’B’ should be supported 
at this point. 

The R’G’B’ data is processed as discussed 
when using a 7.5 IRE blanking pedestal. How- 
ever, no blanking pedestal is added during 
active video, and the sync values are 16-252 
instead of 16-240. 

At this point, we have digital R’G’B’ with 
syne and blanking information, as shown in 
Figure 5.4 and Table 5.4. The numbers in 
parentheses in Figure 5.4 indicate the data 
value for a 10-bit DAC with a full-scale output 
value of 1.305V. The digital R’G’B’ data may 
drive three 10-bit DACs that generate a 0- 
1.305V output to generate the analog R’G’B’ 
video signals. 
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Analog R’G’B’ Digitization 

Assuming 10-bit ADCs with an input range of 
0-1.305V, the 10-bit R’'G’B’ to YCbCr equations 
are: 


Yo01 = 0.478(R’ - 252) + 0.938(G’ - 252) + 
0.182(B’ — 252) + 64 


Cb = -0.275(R’ - 252) - 0.542 (G’ - 252) + 
0.817 (B’ — 252) + 512 


Cr = 0.817(R’ — 252) - 0.685(G’ - 252) - 
0.132 (B’ — 252) + 512 


R’G’'B’ has a nominal 10-bit range of 252- 
800 to match the active video levels used by 
the NTSC/PAL decoder in Chapter 9. Table 5.4 
and Figure 5.4 illustrate the 10-bit R’G’B’ val- 
ues for the white, black, blank, and (optional) 
sync levels. 


HDTV RGB Interface 


Some HDTV consumer video equipment sup- 
ports an analog R’G’B’ video interface. Three 
separate RCA phono connectors (consumer 
market) or BNC connectors (pro-video and PC 
market) are used. 

The horizontal and vertical video timing 
are dependent on the video standard, as dis- 
cussed in Chapter 4. For sources, the video 
signal at the connector should have a source 
impedance of 75Q +5%. For receivers, video 
inputs should be AC-coupled and have a 75-Q 
+5% input impedance. The three signals must 
be coincident with respect to each other within 
+5 ns. 

Sync information may be present on just 
the green channel, all three channels, as a sep- 
arate composite sync signal, or as separate hor- 
izontal and vertical sync signals. A gamma of 
1/0.45 is used. 


As shown in Figure 5.5, the nominal active 
video amplitude is 700 mV, and has no blank- 
ing pedestal. A +300 +6 mV tri-level composite 
sync signal may be present on just the green 
channel (consumer market), or all three chan- 
nels (pro-video market). DC offsets up to +1V 
may be present. 


Analog R°G’B’ Generation 


Assuming 10-bit DACs with an output range of 
0-1.305V, the 10-bit YCbCr to R’G’B’ equations 
are: 


R’ = 0.625 (Y¥799 - 64) + 0.963 (Cr - 512) 


G’ = 0.625 (Y799 - 64) - 0.287(Cr - 512) — 
0.114 (Cb - 512) 


B’ = 0.625(Y799 - 64) + 1.136(Cb — 512) 


R’GB’ has a nominal 10-bit range of 0-548 
to match the active video levels used by the 
NTSC/PAL encoder in Chapter 9. Note that 
negative values of R’'G’B’ should be supported 
at this point. 

The R’G’B’ data is clamped by a blanking 
signal that has a raised cosine distribution to 
slow the slew rate of the start and end of the 
video signal. For 1080-line interlaced and 720- 
line progressive HDTV systems, blank rise and 
fall times are 54 +20 ns. For 1080-line progres- 
sive HDTV systems, blank rise and fall times 
are 27 +10 ns. 

Composite sync information may be added 
to the R’G’B’ data after the blank processing 
has been performed. Values of 16 (sync low), 
488 (high sync), or 252 (no sync) are assigned. 
The sync rise and fall times should be pro- 
cessed to generate a raised cosine distribution 
to slow the slew rate of the sync signal. For 
1080-line interlaced HDTV systems, sync rise 
and fall times are 54 +20 ns, and the horizontal 
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1.020V j WHITE LEVEL (800) 
100 IRE 
0.622 V i SYNC LEVEL (488) 
43 IRE 
0.321 V u | BLACK / BLANK LEVEL (252) 


43 IRE 
0.020 V ' SYNC LEVEL (16) 
GREEN, BLUE, OR RED CHANNEL, SYNC PRESENT 
1.020 V | WHITE LEVEL (800) 
100 IRE 
0.321 V u BLACK / BLANK LEVEL (252) 


GREEN, BLUE, OR RED CHANNEL, NO SYNC PRESENT 


Figure 5.5. HDTV Analog RGB Levels. 0 IRE blanking level. 
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sync low and high widths at the 50%-points are 
593 +40 ns. For 720-line progressive HDTV 
systems, sync rise and fall times are 54 +20 ns, 
and the horizontal sync low and high widths at 
the 50%-points are 539 +40 ns. For 1080-line 
progressive HDTV systems, sync rise and fall 
times are 27 +10 ns, and the horizontal sync 
low and high widths at the 50%-points are 296 
+20 ns 

At this point, we have digital R’G’B’ with 
sync and blanking information, as shown in 
Figure 5.5 and Table 5.5. The numbers in 
parentheses in Figure 5.5 indicate the data 
value for a 10-bit DAC with a full-scale output 
value of 1.305V. The digital R’G’B’ data may 
drive three 10-bit DACs that generate a 0- 
1.305V output to generate the analog R’G’B’ 
video signals. 


Video 0 IRE 
Level Blanking Pedestal 
white 800 
sync - high 488 
black 252 
blank 252 
sync - low 16 


Table 5.5. HDTV 10-Bit R°G°B’ Values. 


Analog R°G'B’ Digitization 
Assuming 10-bit ADCs with an input range of 


0-1.305V, the 10-bit R'G’B’ to YCbCr equations 
are: 


Y79 = 0.341(R’ — 252) + 1.143(G’ - 252) + 
0.115(B’ — 252) + 64 


Cb = -0.188(R’ - 252) - 0.629(G’ - 252) + 
0.817 (B’ — 252) + 512 


Cr = 0.817(R’ - 252) - 0.743 (G’ - 252) - 
0.074(B’ — 252) + 512 


R’GB’ has a nominal 10-bit range of 252- 
800 to match the active video levels used by 
the NTSC/PAL decoder in Chapter 9. Table 5.5 
and Figure 5.5 illustrate the 10-bit R’G’B’ val 
ues for the white, black, blank, and (optional) 
sync levels. 


SDTV YPbPr Interface 


Some SDTV consumer video equipment sup- 
ports an analog YPbPr video interface. Vertical 
blanking interval (VBI) information, such as 
closed captioning and teletext, may be present 
on the Y signal. Three separate RCA phono 
connectors (consumer market) or BNC con- 
nectors (pro-video market) are used. 

The horizontal and vertical video timing 
are dependent on the video standard, as dis- 
cussed in Chapter 4. For sources, the video 
signal at the connector should have a source 
impedance of 75Q +5%. For receivers, video 
inputs should be AC-coupled and have a 75-Q 
+5% input impedance. The three signals must 
be coincident with respect to each other within 
+5 ns. 

For consumer products, composite sync is 
present on only the Y channel. For pro-video 
applications, composite sync is present on all 
three channels. A gamma of 1/0.45 is speci- 
fied. 

As shown in Figures 5.6 and 5.7, the Y sig- 
nal consists of 700 mV of active video (with no 
blanking pedestal). Pb and Pr have a peak-to- 
peak amplitude of 700 mV. A 300 +6 mV com- 
posite sync signal is present on just the Y chan- 
nel (consumer market), or all three channels 
(pro-video market). DC offsets up to +1V may 
be present. The 100% and 75% YPbPr color bar 
values are shown in Tables 5.6 and 5.7. 
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PB OR PR CHANNEL, NO SYNC PRESENT 


Figure 5.6. SDTV Analog YPbPr Levels. Sync ony. 
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Figure 5.7. SDTV Analog YPbPr Levels. Sync on YPbPr. 
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= 
IRE 100 88.6 70.1 58.7 41.3 29.9 11.4 0 
q mV 700 620 491 411 289 209 80 0 
IRE 0 -50 16.9 -33.1 33.1 -16.9 50 0 
EP mV 0 -350 118 -232 232 -118 350 0 
IRE 0 8.1 -50 -41.9 41.9 50 -8.1 0 
al mV 0 57 -350 -293 293 350 -57 0 
Table 5.6. SDTV YPbPr 100% Color Bars. Values are relative to the blanking level. 
2 2 < s g s © rn 

= 
IRE 75 66.5 52.6 44.0 31.0 22.4 8.6 0 
: mV 525 465 368 308 217 157 60 0 
IRE 0 -37.5 12.7 -24.8 24.8 -12.7 37.5 0 
We mV 0 -263 89 -174 174 -89 263 0 
IRE 0 6.1 -37.5 -31.4 31.4 37.5 -6.1 0 
ee mV 0 43 -263 -220 220 263 43 0 


Table 5.7. SDTV YPbPr 75% Color Bars. Values are relative to the blanking level. 
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Analog YPbPr Generation 


Assuming 10-bit DACs with an output range of 
0-1.305V, the 10-bit YCbCr to YPbPr equations 
are: 


Y = 0.625 (Y 601 — 64) 
Pb = 0.612(Cb - 512) 
Pr = 0.612(Cr - 512) 


Y has a nominal 10-bit range of 0-548 to 
match the active video levels used by the 
NTSC/PAL encoder in Chapter 9. Pb and Pr 
have a nominal 10-bit range of 0 to +274. Note 
that negative values of Y should be supported 
at this point. 

The YPbPr data is clamped by a blanking 
signal that has a raised cosine distribution to 
slow the slew rate of the start and end of the 
video signal. For interlaced SDTV systems, 
blank rise and fall times are 140 +20 ns. For 
progressive SDTV systems, blank rise and fall 
times are 70 +10 ns. 

Composite syne information is added to 
the Y data after the blank processing has been 
performed. Values of 16 (sync present) or 252 
(no sync) are assigned. The sync rise and fall 
times should be processed to generate a raised 
cosine distribution (between 16 and 252) to 
slow the slew rate of the sync signal. 

Composite syne information may also be 
added to the PbPr data after the blank process- 
ing has been performed. Values of 276 (sync 
present) or 512 (no sync) are assigned. The 
sync rise and fall times should be processed to 
generate a raised cosine distribution (between 
276 and 512) to slow the slew rate of the sync 
signal. 

For interlaced SDTV systems, sync rise 
and fall times are 140 +20 ns, and horizontal 
sync width at the 50%-point is 4.7 +0.1 us. For 
progressive SDTV systems, sync rise and fall 


times are 70 +10 ns, and horizontal syne width 
at the 50%-point is 2.33 +0.05 us. 

At this point, we have digital YPbPr with 
sync and blanking information, as shown in 
Figures 5.6 and 5.7 and Table 5.8. The num- 
bers in parentheses in Figures 5.6 and 5.7 indi- 
cate the data value for a 10-bit DAC with a full- 
scale output value of 1.305V. The digital YPbPr 
data may drive three 10-bit DACs that generate 
a 0-1.305V output to generate the analog 
YPbPr video signals. 


et Y PbPr 
white 800 512 
black 252 512 
blank 252 512 
sync 16 276 


Table 5.8. SDTV 10-Bit YPbPr Values. 


Analog YPbPr Digitization 


Assuming 10-bit ADCs with an input range of 
0-1.305V, the 10-bit YPbPr to YCbCr equations 
are: 


Yo1 = 1.599(Y - 252) + 64 
Cb = 1.635 (Pb - 512) +512 
Cr = 1.635 (Pr - 512) +512 


Y has a nominal 10-bit range of 252-800 to 
match the active video levels used by the 
NTSC/PAL decoder in Chapter 9. Table 5.8 
and Figures 5.6 and 5.7 illustrate the 10-bit 
YPbPr values for the white, black, blank, and 
(optional) sync levels. 
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HDTV YPbPr Interface 


Some HDTV consumer video equipment sup- 
ports an analog YPbPr video interface. Three 
separate RCA phono connectors (consumer 
market) or BNC connectors (pro-video mar- 
ket) are used. 

The horizontal and vertical video timing 
are dependent on the video standard, as dis- 
cussed in Chapter 4. For sources, the video 
signal at the connector should have a source 
impedance of 75Q +5%. For receivers, video 
inputs should be AC-coupled and have a 75-Q 
+5% input impedance. The three signals must 
be coincident with respect to each other within 
+5 ns. 

For consumer products, composite sync is 
present on only the Y channel. For pro-video 
applications, composite sync is present on all 
three channels. A gamma of 1/0.45 is speci- 
fied. 

As shown in Figures 5.8 and 5.9, the Y sig- 
nal consists of 700 mV of active video (with no 
blanking pedestal). Pb and Pr have a peak-to- 
peak amplitude of 700 mV. A +300 +6 mV com- 
posite sync signal is present on just the Y chan- 
nel (consumer market), or all three channels 
(pro-video market). DC offsets up to +1V may 
be present. The 100% and 75% YPbPr color bar 
values are shown in Tables 5.9 and 5.10. 


Analog YPbPr Generation 


Assuming 10-bit DACs with an output range of 
0-1.305V, the 10-bit YCbCr to YPbPr equations 
are: 

Y = 0.625(Y 799 - 64) 

Pb = 0.612(Cb — 512) 

Pr = 0.612(Cr — 512) 


Y has a nominal 10-bit range of 0-548 to 
match the active video levels used by the 
NTSC/PAL encoder in Chapter 9. Pb and Pr 
have a nominal 10-bit range of 0 to +274. Note 
that negative values of Y should be supported 
at this point. 

The YPbPr data is clamped by a blanking 
signal that has a raised cosine distribution to 
slow the slew rate of the start and end of the 
video signal. For 1080-line interlaced and 720- 
line progressive HDTV systems, blank rise and 
fall times are 54 +20 ns. For 1080-line progres- 
sive HDTV systems, blank rise and fall times 
are 27 +10 ns. 

Composite sync information is added to 
the Y data after the blank processing has been 
performed. Values of 16 (sync low), 488 (high 
sync), or 252 (no sync) are assigned. The sync 
rise and fall times should be processed to gen- 
erate a raised cosine distribution to slow the 
slew rate of the sync signal. 

Composite sync information may be added 
to the PbPr data after the blank processing has 
been performed. Values of 276 (sync low), 748 
(high sync), or 512 (no sync) are assigned. 
The sync rise and fall times should be pro- 
cessed to generate a raised cosine distribution 
to slow the slew rate of the sync signal. 

For 1080-line interlaced HDTV systems, 
sync rise and fall times are 54 +20 ns, and the 
horizontal sync low and high widths at the 50%- 
points are 593 +40 ns. For 720-line progressive 
HDTV systems, sync rise and fall times are 54 
+20 ns, and the horizontal sync low and high 
widths at the 50%-points are 539 +40 ns. For 
1080-line progressive HDTV systems, sync rise 
and fall times are 27 +10 ns, and the horizontal 
sync low and high widths at the 50%-points are 
296 +20 ns. 

At this point, we have digital YPbPr with 
syne and blanking information, as shown in 
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PB OR PR CHANNEL, NO SYNC PRESENT 


Figure 5.8. HDTV Analog YPbPr Levels. Sync ony. 
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1.020 V 


0.622 V 


0.321 V 


0.020 V 


1.003 V 
0.954 V 


0.653 V 


0.352 V 
0.303 V 


— 


100 IRE 


= 


50 IRE 


50 IRE 


WHITE LEVEL (800) 


SYNC LEVEL (488) 
43 IRE 
BLACK/ BLANK LEVEL (252) 
43 IRE 
SYNC LEVEL (16) 
Y CHANNEL, SYNC PRESENT 
PEAK LEVEL (786) 
1 SYNC LEVEL (748) 
43 IRE 
i BLACK/ BLANK LEVEL (512) 
43 IRE 
' SYNC LEVEL (276) 


PEAK LEVEL (238) 


PB OR PR CHANNEL, SYNC PRESENT 


Figure 5.9. HDTV Analog YPbPr Levels. Sync on YPbPr. 
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2 z < 5 € so © x 
= 

IRE 100 92.8 78.7 71.5 28.5 21.3 7.2 0 

bi mV 700 649 Sol 501 199 149 51 0 
IRE 0 -50 11.5 -38.5 38.5 -11.5 50 0 

oe mV 0 -350 80 -270 270 -80 350 0 
IRE 0 4.6 -50 45.4 45.4 50 -4.6 0 

id mV 0 32 -350 -318 318 350 -32 0 


§ 

= 
IRE 75 69.6 59.1 53.6 21.4 15.9 5.4 0 
i mV 525 487 413 375 150 112 38 0 
IRE 0 -37.5 8.6 -28.9 28.9 -8.6 37.5 0 
sg mV 0 -263 60 -202 202 -60 263 0 
IRE 0 3.4 -37.5 -34.1 34.1 37.5 -3.4 0 
a mV 0 24 -263 -238 238 263 -24 0 


Table 5.10. HDTV YPbPr 75% Color Bars. Values are relative to the blanking level. 
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Figures 5.8 and 5.9 and Table 5.11. The num- 
bers in parentheses in Figures 5.8 and 5.9 indi- 
cate the data value for a 10-bit DAC with a full- 
scale output value of 1.305V. The digital YPbPr 
data may drive three 10-bit DACs that generate 
a 0-1.305V output to generate the analog 
YPbPr video signals. 


sae Y PbPr 
white 800 512 
sync - high 488 748 
black 252 512 
blank 252 512 
sync - low 16 276 


Table 5.11. HDTV 10-Bit YPbPr Values. 


Analog YPbPr Digitization 
Assuming 10-bit ADCs with an input range of 
0-1.305V, the 10-bit YPbPr to YCbCr equations 
are: 

Y799 = 1.599(Y — 252) + 64 

Ch = 1.635(Pb — 512) + 512 

Cr = 1.635(Pr — 512) + 512 


Y has a nominal 10-bit range of 252-800 to 
match the active video levels used by the 
NTSC/PAL decoder in Chapter 9. Table 5.11 
and Figures 5.8 and 5.9 illustrate the 10-bit 
YPbPr values for the white, black, blank, and 
(optional) sync levels. 


Other Pro-Video Analog 
Interfaces 


Tables 5.12 and 5.13 list some other common 
component analog video formats. The horizon- 
tal and vertical timing is the same as for 525- 
line (M) NTSC and 625-line (B, D, G, H, D 
PAL. The 100% and 75% color bar values are 
shown in Tables 5.14 through 5.17. The 
SMPTE, EBU N10, 625-line Betacam, and 625- 
line MII values are the same as for SDTV 
YPbPr. 


VGA Interface 


Table 5.18 and Figure 5.10 illustrate the 15-pin 
VGA connector used by computer equipment, 
and some consumer equipment, to transfer 
analog RGB signals. The analog RGB signals 
do not contain sync information and have no 
blanking pedestal, as shown in Figure 5.4. 


VGA Interface 
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Signal 
Format pbs Amplitudes Notes 
8 (volts) 
SMPTE bs S00 0% setup on Y 
EBU N10 sync -0.300 100% saturation 
, , three wire = (Y + sync), (R’-Y), (B’-Y) 
R-Y, B’-Y +0.350 
525-Line = ue 7.5% setup on Y only 
Betacam! sync -0.286 100% saturation 
R-Y, B-Y +0.467 three wire = (Y + sync), (R’-Y), (B’-Y) 
625-Line = am 0% setup on Y 
Betacam! sync -0.300 100% saturation 
R-Y, B-Y +0350 three wire = (Y + sync), (RY), (B’-Y) 
525-Line is im 7.5% setup on Y only 
MII2 syne -0.300 100% saturation 
R-Y, B-Y 40.324 three wire = (Y + sync), (R’-Y), (B’-Y) 
625-Line = bali 0% setup on Y 
MII2 syne -0.300 100% saturation 
R-Y, B-Y +0350 three wire = (Y + sync), (RY), (B’-Y) 
Notes: 


1. Trademark of Sony Corporation. 


2. Trademark of Matsushita Corporation. 


Table 5.12. Common Pro-Video Component Analog Video Formats. 
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Signal 
Format cat Amplitudes Notes 
B (volts) 
, , , 0/ f d +4 
sueret: |= ee ee See ee 
EBU N10 : ; haat 
syne —0.300 three wire = (G’ + sync), B’, R 
, , , Oo - ¥ . 
(setup) —~0.286 < 7 i ‘j 
syne . three wire = (G’ + sync), B’, R 
G’,B’, R’ +0.714 0% setup on G’, B’, and R’ 
Gare ) 100% saturation 
Pp syne —0.286 three wire = (G’ + sync), B’, R’ 
G’,B’, R’ +0.700 7.5% setup on G’, B’, and R’ 
Mir? 100% saturation 
syne —0.300 three wire = (G’ + sync), B’, R’ 
Notes: 


1. Trademark of Matsushita Corporation. 


Table 5.13. Common Pro-Video RGB Analog Video Formats. 
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£ 

= 
TRE 100 89.5 72.3 61.8 45.7 35.2 18.0 7.5 
* mV 714 639 517 441 326 251 129 54 
, TRE 0 -65.3 22.0 —43.3 43.3 -22.0 65.3 0 
ei mV 0 —466 157 -309 309 -157 466 0 
; TRE 0 10.6 -65.3 —94.7 54.7 65.3 -10.6 0 
aa mV 0 76 —466 -391 391 466 -76 0 


Table 5.14. 525-Line Betacam 100% Color Bars. Values are relative to the 


§ 

= 
IRE 76.9 69.0 56.1 48.2 36.2 28.2 15.4 7.5 
m mV 549 492 401 344 258 202 110 54 
: TRE 0 -49.0 16.5 -32.5 32.5 -16.5 49.0 0 
vo mV 0 -350 118 -232 232 -118 350 0 
, TRE 0 8.0 -49.0 -41.0 41.0 49.0 -8.0 0 
= mV 0 57 -350 -293 293 350 -57 0 


Table 5.15. 525-Line Betacam 75% Color Bars 


. Values are relative to the blanking level. 
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BY 
= 
TRE 100 89.5 72.3 61.8 45.7 39.2 18.0 7.5 
” mV 700 626 506 433 320 246 126 53 
, TRE 0 -46.3 15.6 -30.6 30.6 -15.6 46.3 0 
seca mV 0 -324 109 -214 214 -109 324 0 
, IRE 0 7.5 -46.3 -38.7 38.7 46.3 -7.5 0 
aes mV 0 53 -324 -271 271 324 -53 0 
Table 5.16. 525-Line MII 100% Color Bars. Values are relative to the blanking level. 
2] 2:)/s])¢/)]é8l)esi|e|# 
s/s|6¢ || #8) 2] a] g 
= 
IRE 76.9 69.0 56.1 48.2 36.2 28.2 15.4 7.5 
" mV 538 483 393 338 253 198 108 53 
: IRE 0 -34.7 11.7 -23.0 23.0 -11.7 34.7 0 
— mV 0 -243 82 -161 161 -82 243 0 
; TRE 0 5.6 -34.7 -29.0 29.0 34.7 -5.6 0 
ica mV 0 39 -243 -203 203 243 -39 0 


Table 5.17. 525-Line MII 75% Color Bars. Values are relative to the blanking level. 


Figure 5.10. VGA 15-Pin D-SUB Female Connector. 
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Pin Function Signal Level Impedance 
1 red 0.7v 75 ohms 
2 green 0.7v 75 ohms 
3 blue 0.7v 75 ohms 
4 reserved 
5 ground 
6 red ground 
7 green ground 
8 blue ground 
9 +5V DC 

10 sync ground 

11 reserved 

12 DDC SDA > 2.4v 

13 HSYNC (horizontal sync) > 2.4v 

14 VSYNC (vertical sync) > 2.4v 

15 DDC SCL > 2.4v 

Notes: 


1. DDC = Display Data Channel. 


Table 5.18. VGA Connector Signals. 
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Digital Video 
Interfaces 


Pro-Video Component 
Interfaces 


Table 6.1 lists the parallel and serial digital 
interfaces for various pro-video formats. 


Video Timing 


Rather than digitize and transmit the blanking 
intervals, special sequences are inserted into 
the digital video stream to indicate the start of 
active video (SAV) and end of active video 
(EAV). These EAV and SAV sequences indicate 
when horizontal and vertical blanking are 
present and which field is being transmitted. 
They also enable the transmission of ancillary 
data such as digital audio, teletext, captioning, 
etc. during the blanking intervals. 

The EAV and SAV sequences must have 
priority over active video data or ancillary data 
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to ensure that correct video timing is always 
maintained at the receiver. The receiver 
decodes the EAV and SAV sequences to 
recover the video timing. 

The video timing sequence of the encoder 
is controlled by three timing signals discussed 
in Chapter 4: H (horizontal blanking), V (verti- 
cal blanking), and F (Field 1 or Field 2). A 
zero-to-one transition of H triggers an EAV 
sequence while a one-to-zero transition trig- 
gers an SAV sequence. F and V are allowed to 
change only at EAV sequences. 

Usually, both 8-bit and 10-bit interfaces are 
supported, with the 10-bit interface used to 
transmit 2 bits of fractional video data to mini- 
mize cumulative processing errors and to sup- 
port 10-bit ancillary data. 

YCbCr or R’G’B’ data may not use the 10- 
bit values of 000q-003y and 3FCy-3FFy, or 
the 8-bit values of 00,, and FFy, since they are 
used for timing information. 


Pro-Video Component Interfaces 
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Active Total Display | Frame ae ie SDTV Digital Digital 
Resolution | Resolution+ Aspect Rate sai or Parallel Serial 
(H X V) (H X V) Ratio (Hz) (MHz) HDTV Standard Standard 
BT696 BT656 
720 x 480 858 x 5251 4:3 29.97 13.5 SDTV BT:799 BL-799 
SMPTE 125M : 
; BT.1362 
720 x 480 858 x 525p 4:3 59.94 27 SDTV - SMPTE 294M 
: : BT.656 BT.656 
720 x 576 864 x 6251 4:3 25 13.5 SDTV BL-799 BL-799 
720 x 576 864 x 625p 4:3 50 27 SDTV - BT.1362 
BT.1302 
960x480 | 1144x5251 | 169 | 2997 | 18 | SDTV BT.1303 Ste 
SMPTE 267M ° 
‘ : BT.1302 BT.1302 
960 x 576 1152 x 625i 16:9 25 18 SDTV BT1303 BL1303 
1280 x 720 | 1650 x 750p 16:9 59.94 74.176 | HDTV SMPTE 274M - 
1280 x 720 | 1650 x 750p 16:9 60 74.25 HDTV SMPTE 274M - 
; : BT.1120 BT.1120 
1920 x 1080 | 2200 x 1125i 16:9 29.97 74.176 | HDTV SMPTE 274M SMPTE 292M 
; : BT.1120 BT.1120 
1920 x 1080 | 2200 x 1125i 16:9 30 74.25 HDTV SMPTE 274M SMPTE 292M 
. BT.1120 
1920 x 1080 | 2200 x 1125p 16:9 59.94 148.35 | HDTV SMPTE 274M - 
: BT.1120 
1920 x 1080 | 2200 x 1125p 16:9 60 148.5 HDTV SMPTE 274M - 
1920 x 1080 | 2376 x 1250i 16:9 25 74.25 HDTV BT.1120 BT.1120 
1920 x 1080 | 2376 x 1250p 16:9 50 148.5 HDTV BT.1120 - 


Table 6.1. Pro-Video Parallel and Serial Digital Interface Standards for Various 


Component Video Formats. 


44= interlaced, p = progressive. 
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The EAV and SAV sequences are shown in 
Table 6.2. The status word is defined as: 


F = “0” for Field 1 F = “1” for Field 2 
V = “1” during vertical blanking 

H = “0” at SAV H = “1” at EAV 
P3-P0 = protection bits 


P3=VOH 
P2=FOH 
Pl=FOV 
PO=F®VOH 


where © represents the exclusive-OR function. 
These protection bits enable one- and two-bit 
errors to be detected and one-bit errors to be 
corrected at the receiver. 

For 4:2:2 YCbCr data, after each SAV 
sequence, the stream of active data words 
always begins with a Cb sample, as shown in 
Figure 6.1. In the multiplexed sequence, the 
co-sited samples (those that correspond to the 
same point on the picture) are grouped as Cb, 
Y, Cr. During blanking intervals, unless ancil- 
lary data is present, 10-bit Y or R’'G’B’ values 
should be set to 040, and 10-bit CbCr values 
should be set to 2004. 

The receiver detects the EAV and SAV 
sequences by looking for the 8-bit FFy 00q 00y 
preamble. The status word (optionally error 


corrected at the receiver, see Table 6.3) is used 
to recover the H, V, and F timing signals. 


Ancillary Data 


Ancillary data packets are used to transmit 
information (such as digital audio, closed cap- 
tioning, and teletext data) during the blanking 
intervals. ITU-R BT.1364 and SMPTE 291M 
describe the ancillary data formats. 

During horizontal blanking, ancillary data 
may be transmitted in the interval between the 
EAV and SAV sequences. During vertical 
blanking, ancillary data may be transmitted in 
the interval between the SAV and EAV 
sequences. Multiple ancillary packets may be 
present in a horizontal or vertical blanking 
interval, but they must be contiguous with 
each other. Ancillary data should not be 
present where indicated in Table 6.4 since 
these regions may be affected by video switch- 
ing. 
There are two types of ancillary data for- 
mats. The older Type 1 format uses a single 
data ID word to indicate the type of ancillary 
data; the newer Type 2 format uses two words 
for the data ID. The general packet format is 
shown in Table 6.5. 


8-bit Data 10-bit Data 
Be D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 
1 1 1 1 1 1 1 1 1 1 
preamble 0 0 0 0 0 0 0 0 0 0 
0 0 0 0 0 0 0 0 0 0 
status word 1 F Vv H P3 P2 Pl PO 0 0 


Table 6.2. EAV and SAV Sequence. 
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1716 (1728) 


pce Received F, V, H (Bits D8-D6) 
000 001 010 O11 100 101 110 111 
0000 000 000 000 . 000 a * 111 
0001 000 is * 111 * 111 111 111 
0010 000 * * 011 * 101 * * 
0011 * is 010 * 100 * * 111 
0100 000 * * 011 * * 110 * 
0101 i 001 id * 100 is i 111 
0110 i 011 011 011 100 * . 011 
0111 100 is if 011 100 100 100 * 
1000 000 i * id 101 110 is 
1001 * 001 010 * * Ld * 111 
1010 * 101 010 * 101 101 * 101 
1011 010 ® 010 010 * 101 010 * 
1100 * 001 110 * 110 * 110 110 
1101 001 001 e 001 i 001 110 ‘si 
1110 i! i . 011 * 101 110 i 
1111 001 010 i 100 bi - ki 
Notes: 
* = uncorrectable error. 
Table 6.3. SAV and EAV Error Correction at Decoder. 
| BT.601 H SIGNAL 
START OF DIGITAL LINE START OF DIGITAL ACTIVE LINE NEXT LINE 
EAV CODE BLANKING SAV CODE CO-SITED CO-SITED 
BT.656 
4:2:2 
VIDEO 


Figure 6.1. BT.656 Parallel Interface Data For One Scan Line. 525-line; 4:2:2 
YCbCr; 720 active samples per line; 27 MHz clock; 10-bit system. The values for 


625-line systems are shown in parentheses. 
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Sampling Rate Video Line Numbers Sample Numbers 
(MHz) Standard Affected Affected 
13.5 525line if a ree 
13.5 625ine : oe, ee 
18 525-line i a iors 
18 625-line 7 oo 1924-2209 
74.25 ges ot 
74.25/1.001 aeons 4 an pater 


Table 6.4. Ancillary Regions Affected by Switching. 


Data ID (DID) 

DID indicates the type of data being sent. The 
assignment of most of the DID values is con- 
trolled by the ITU and SMPTE to ensure 
equipment compatibility. A few DID values are 
available that don’t require registration. Some 
DID values are listed in Table 6.6. 


Secondary ID (SDID, Type 2 Only) 

SDID is also part of the data ID for Type 2 
ancillary formats. The assignment of most of 
the SDID values is also controlled by the ITU 
and SMPTE to ensure equipment compatibil- 
ity. A few SDID values are available that don’t 
require registration. Some SDID values are 
listed in Table 6.6. 


Data Block Number (DBN, Type 1 Only) 

DBN is used to allow multiple ancillary pack- 
ets (sharing the same DID) to be put back 
together at the receiver. This is the case when 
there are more than 255 user data words 
required to be transmitted, thus requiring 


more than one ancillary packet to be used. The 
DBN value increments by one for each consec- 
utive ancillary packet. 


Data Count (DC) 

DC specifies the number of user data words in 
the packet. In 8-bit applications, it specifies the 
six MSBs of an 8-bit value, so the number of 
user data words must be an integral number of 
four. 


User Data Words (UDW) 

Up to 255 user data words may be present in 
the packet. In 8-bit applications, the number of 
user data words must be an integral number of 
four. Padding words may be added to ensure 
an integral number of four user data words are 
present. 

User data may not use the 10-bit values of 
000y-003y and 3FCy-3FFy, or the 8-bit values 
of 00y and FFy, since they are used for timing 
information. 
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8-bit Data 10-bit Data 
ad D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 
0 0 (0) 0 (0) 0 0 0 0 0 
ancillary data 
flag (ADF) 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 
data ID aa even 
(DID) D8 parity Value of 0000 0000 to 1111 1111 
data block Se even 
jiinbes or SDID D8 Ronis Value of 0000 0000 to 1111 1111 
data count aS even 
(DC) D8 parity Value of 0000 0000 to 1111 1111 
user data word 0 Value of 00 0000 0100 to 11 1111 1011 
user data word N Value of 00 0000 0100 to 11 1111 1011 
=S Sum of D0-D8 of data ID through last user data word. 
check sum D8 y ae 
Preset to all zeros; carry is ignored. 


Table 6.5. Ancillary Data Packet General Format. 
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8-bit 8-bit 
DID Function DID Function 
Type 2 Type 1 
004 undefined 80y marked for deletion 
01y-03y reserved 81y-83y reserved 
0414, 084, OCy | 8-bit applications 84y end marker 
104-3F y reserved 85y-BF y reserved 
40y-5Fy user application COy-DF y user application 
60y timecode EOy-EBy | registered 
6ly closed captioning ECy AES control packet, group 4 
62y-7F y registered EDy AES control packet, group 3 
EEq AES control packet, group 2 
EFy AES control packet, group 1 
8-bit F4y error detection 
SDID Function 
Type 2 Foy longitudinal timecode 
004 undefined format F8y AES extended packet, group 4 
x0q 8-bit applications Foy AES audio data, group 4 
x4y 8-bit applications FAyq AES extended packet, group 3 
x8q 8-bit applications FBy AES audio data, group 3 
xCy 8-bit applications FCy AES extended packet, group 2 
all others unassigned FDy AES audio data, group 2 
FEy AES extended packet, group 1 
FFy AES audio data, group 1 


Table 6.6. DID and SDID Assignments. 
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Audio Samples per Frame: 29.97 Hz Video 
Sampling Samples per 
Rate E ti , E Sone: Frame: 25 Hz 
(kHz) Samples per Samples per Samples per ae Nanbee ae Video 
Frame Field 1 Field 2 
Number Samples 
48.0 8008 / 5 1602 1601 - - 1920 
23 1471 
44.1 147147 / 100 1472 1471 47 1471 1764 
71 1471 
4 1068 
32 16016 / 15 1068 1067 8 1068 1280 
12 1068 


Table 6.7. Isochronous Audio Sample Rates. 


Digital Audio Format 

ITU-R BT:1305 and SMPTE 272M describe the 
transmission of digital audio as ancillary data. 
2-16 channels of up to 24-bit digital audio are 
supported, with sample rates of 32-48 kHz. 
Table 6.7 lists the number of audio samples per 
video frame for various audio sample rates. 

Audio data of up to 20 bits per sample is 
transferred using the format in Table 6.8. “V” 
is the AES/EBU sample valid bit, “U” is the 
AES/EBU user bit, and ”C” is the AES/EBU 
audio channel status bit. “P” is an even parity 
bit for the 26 previous bits in the sample 
(excluding D9 in the first and second words of 
the audio sample). Audio is represented as 
two’s complement linear PCM data. 

To support 24-bit audio samples, extended 
data packets may be used to transfer the four 
auxiliary bits of the AES/EBU audio stream. 

Audio data is formatted as 1-4 groups, 
defined by [gr 1] and [gr 0], with each group 
having 1-4 channels of audio data, defined by 
[ch 1] and [ch 0]. 


Optional control packets may be used on 
lines 12 and 275 (525-line systems) or lines 8 
and 320 (625-line systems) to specify the sam- 
ple rate, delay relative to the video, etc. If 
present, it must be transmitted prior to any 
audio packets. If not transmitted, a default con- 
dition of 48 kHz isochronous audio is assumed. 


Timecode Format 

ITU-R BT.1366 defines the transmission of 
timecode using ancillary data for 525-line, 625- 
line, and 1125-line systems. The ancillary 
packet format is shown in Table 6.9, and is 
used to convey longitudinal (LTC) or vertical 
interval timecode (VITC) information. For 
additional information on the timecode format, 
and the meaning of the flags in Table 6.9, see 
the timecode discussion in Chapter 8. 


Binary Bit Group 1 

The eight bits that comprise binary bit group 1 
(DBB10-DBB17) specify the type of timecode 
and user data, as shown in Table 6.10. 
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Preset to all zeros; carry is ignored. 


8-bit Data 10-bit Data 
pe D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 
0 0 0 0 0 0 0 0 0 0 
ancillary data 
flag (ADF) 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 
data ID a0 even 
(DID) D8 parity 1 1 1 1 1 grl gr0 1 
data block no even 
number (DBN) D8 aay Value of 0000 0000 to 1111 1111 
data count aS even 
(DC) D8 batty Value of 0000 0000 to 1111 1111 
D8 AS A4 A3 A2 Al AO chl ch0 Z 
audio sample 0 D8 Al4 Al13 Al2 All A10 AQ A8 AZ A6 
D8 P Cc U V Al19 A18 A17 Al6 Al5 
D8 AS A4 A3 A2 Al AO ch1l ch0 Z 
audio sample N D8 Al4 Al13 Al2 All Al0 AQ A8& AZ A6 
D8 P Cc U V Al19 A18 A17 Al6 Al5 
Fee a Ds Sum of D0-D8 of data ID through last audio sample word. 


Table 6.8. Digital Audio Ancillary Data Packet Format. 
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8-bit Data 10-bit Data 
oS D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 
0 0 0 0 0 0 0 0 0 0 
neapn 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 
data ID (DID) D8 EP 0 1 1 0 0 0 0 0 
SDID D8 EP 0 1 1 0 0 0 0 0 
data count (DC) D8 EP 0 0 0 1 0 0 0 0 
D8 EP units of frames DBB10 0 0 0 
D8 EP user group 1 DBB11 0 0 0 
D8 EP flag 2 flag 1 tens of frames DBB12 0 0 0 
D8 EP user group 2 DBB13 0 0 0 
D8 EP units of seconds DBB14 0 0 0 
D8 EP user group 3 DBB15 0 0 0 
D8 EP flag 3 tens of seconds DBB16 0 0 0 
D8 EP user group 4 DBB17 0 0 0 
timecode data —_ 
D8 EP units of minutes DBB20 0 0 0 
D8 EP user group 5 DBB21 0 0 0 
D8 EP flag 4 tens of minutes DBB22 0 0 0 
D8 EP user group 6 DBB23 0 0 0 
D8 EP units of hours DBB24 0 0 0 
D8 EP user group 7 DBB25 0 0 0 
D8 EP flag 6 flag 5 tens of hours DBB26 0 0 0 
D8 EP user group 8 DBB27 0 0 0 
eae want Ds Sum of ee of data ID through last timecode data word. 
reset to all zeros; carry is ignored. 


Notes: 
EP = even parity for DO-D7. 


Table 6.9. Timecode Ancillary Data Packet Format. 
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DBB17 | DBB16 | DBB15 | DBB14 | DBB13 | DBB12 | DBB11 | DBB10 Definition 
0 0 0 0 0 0 0 0 LTC 
0 0 0 0 0 0 0 1 Field 1 VITC 
0 0 0 0 0 0 1 0 Field 2 VITC 
0 0 0 0 0 0 1 1 
user defined 
0 0 0 0 0 1 1 1 
0 0 0 0 1 0 0 0 
locally generated time 
address and user data 
0 1 1 1 1 1 1 1 
1 0 0 0 0 0 0 0 
reserved 
1 1 1 1 1 1 1 1 


Table 6.10. Binary Bit Group 1 Definitions for 525-Line and 625-Line Systems. 


Binary Bit Group 2 
The eight bits that comprise binary bit group 2 
(DBB20-DBB27) specify line numbering and 
status information. 

DBB20-DBB24 specify the VITC line 
select as shown in Table 6.11. These convey 
the VITC line number location. 

If DBB25 is a “1,” when the timecode infor- 
mation is converted into an analog VITC signal 
on line N, it must also be repeated on line N + 
2. 


If DBB26 is a “1,” a timecode error was 
received, and the transmitted timecode has 
been interpolated from a previous timecode. 


If DBB27 is a “0,” the user group bits are 
processed to compensate for any latency. If a 
“1,” the user bits are retransmitted with no 
delay compensation. 


User Group Bits 

32 bits of user data may be transferred with 
each timecode packet. User data is organized 
as eight groups of four bits each, with the D7 
bit being the MSB. For additional information 
on user bits, see the timecode discussion in 
Chapter 8. 
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525-Line 625-Line 
Interlaced Systems Interlaced Systems 
DBB24 | DBB23 | DBB22 | DBB21 | DBB20 VITC on VITC on VITC on VITC on 
Line N Line N + 2 Line N Line N + 2 
0 0 1 1 0 - - 6, 319 8, 321 
0 0 1 1 1 - - 7, 320 9, 322 
0 1 0 0 0 - - 8, 321 10, 323 
0 1 0 0 1 - - 9, 322 11, 324 
0 1 0 1 0 10, 273 12, 275 10, 323 12, 325 
0 1 0 1 1 11, 274 13, 276 11, 324 13, 326 
0 1 1 0 0 12. 275 14, 277 12, 325 14, 327 
0 1 1 0 1 13, 276 15, 278 13, 326 15, 328 
0 1 1 1 0 14, 277 16, 279 14, 327 16, 329 
0 1 1 1 1 15, 278 17, 280 15, 328 17, 330 
1 0 0 0 0 16, 279 18, 281 16, 329 18, 331 
1 0 0 0 1 17, 280 19, 282 17, 330 19, 332 
1 0 0 1 0 18, 281 20, 283 18, 331 20, 333 
1 0 0 1 1 19, 282 - 19, 332 21, 334 
1 0 1 0 0 20, 283 - 20, 333 22, 335 
1 0 1 0 1 - - 21, 334 - 
1 0 1 1 0 - - 22, 335 - 


Table 6.11. VITC Line Select Definitions for 525-Line and 625-Line Systems. 
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SMPTE 266M 

SMPTE 266M also defines a digital vertical 
interval timecode (DVITC) for 525-line video 
systems. It is an 8-bit digital representation of 
the analog VITC signal, transferred using the 8 
MSBs. If the VITC is present, it is carried on 
the Y data channel in the active portion of lines 
14 and 277. The 90 bits of VITC information 
are carried by 675 consecutive Y samples. A 
10-bit value of 040y represents a “0;” a 10-bit 
value of 300y represents a “1.” Unused Y sam- 
ples have a value of 040. 


EIA-608 Closed Captioning Format 
SMPTE 334M defines the ancillary packet for- 
mat for closed captioning, as shown in Table 
6.12. 

The field bit is a “0” for Field 2 and a “1” for 
Field 1. 

The offset value is a 5-bit unsigned integer 
which represents the offset (in lines) of the 
data insertion line, relative to line 9 or 272 for 
525-line systems and line 5 or 318 for 625-line 
systems. 


EIA-708 Closed Captioning Format 
SMPTE 334M also defines the ancillary packet 
format for digital closed captioning, as shown 
in Table 6.13. 

The payload is the EIA-708 caption distri- 
bution packet (CDP), which has a variable 
length. 


Error Detection Checksum Format 

ITU-R BT:1304 defines a checksum for error 
detection. The ancillary packet format is 
shown in Table 6.14. 

For 13.5 MHz 525-line systems, the ancil- 
lary packet occupies sample words 1689-1711 
on lines 9 and 272. For 13.5 MHz 625-line sys- 
tems, the ancillary packet occupies sample 
words 1701-1723 on lines 5 and 318. Note that 


these locations are immediately prior to the 
SAV code words. 


Checksums 

Two checksums are provided: one for a field of 
active video data and one for a full field of data. 
Each checksum is a 16-bit value calculated as 
follows: 


CRC =x!6 4x24 x54 x! 


For the active CRC, the starting and end- 
ing samples for 13.5 MHz 525-line systems are 
sample word 0 on lines 21 and 284 (start) and 
sample word 1439 on lines 262 and 525 (end). 
The starting and ending samples for 13.5 MHz 
625-line systems are sample word 0 on lines 24 
and 336 (start) and sample word 1439 on lines 
310 and 622 (end). 

For the field CRC, the starting and ending 
samples for 13.5 MHz 525-line systems are 
sample word 1444 on lines 12 and 275 (start) 
and sample word 1439 on lines 8 and 271 
(end). The starting and ending samples for 
13.5 MHz 625-line systems are sample word 
1444 on lines 8 and 321 (start) and sample 
word 1439 on lines 4 and 317 (end). 


Error Flags 
Error flags indicate the status of the previous 


field. 


edh (error detected here): A “1” indicates 
that a transmission error was detected since 
one or more ancillary packets did not match 
its checksum. 


eda (error detected already): A “1” indicates 
a transmission error was detected at a prior 
point in the data path. A device that receives 
data with this flag set should forward the 
data with the flag set and the edh flag reset 
to “0” if no further errors are detected. 
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8-bit Data 10-bit Data 
Sid D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 

0 0 0 0 0 0 0 0 0 0 
Sa ae 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 
data ID (DID) Ds EP 0 1 1 0 0 0 0 1 
SDID D8 EP 0 0 0 0 0 0 1 0 
data count (DC) D8 EP 0 0 0 0 0) 0 1 1 
line D8 EP field 0 0 offset 
caption word 0 D8 EP DO07 D06 D05 D04 DO03 D02 DO1 DOO 
caption word 1 D8 EP D17 D16 D15 D14 D13 D12 D11 D10 
crests | TR Sum of) of a Phra apton word 


Notes: 


EP = even parity for DO-D7. 


Table 6.12. EIA-608 Closed Captioning Ancillary Data Packet Format. 


idh (internal error detected here): A “1” 
indicates that an error unrelated to the 


transmission has been detected. 


ida (internal error status): A “1” indicates 
data was received from a device that does 
not support this error detection method. 


Video Index Format 


If the video index (SMPTE RP-186) is present, 
it is carried on the CbCr data channels in the 
active portion of lines 14 and 277. 

A total of 90 8-bit data words are trans- 
ferred serially by D2 of the 720 CbCr samples 
of the active portion of the lines. A 10-bit value 
of 200 represents a “0;” a 10-bit value of 2044 
represents a “1.” Unused CbCr samples have a 
10-bit value of 200y. 
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Preset to all zeros; carry is ignored. 


8-bit Data 10-bit Data 
pe D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 
0 0 (0) 0 0 0 0 0 0 0 
ancillary data 
flag (ADF) 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 
data ID (DID) D8 EP 0 1 1 0 0 0 0 1 
SDID D8 EP 0 0 0 0 0 0 0 1 
data count (DC) D8 EP Value of 0000 0000 to 1111 1111 
data word O D8 EP Value of 0000 0000 to 1111 1111 
data word N D8 EP Value of 0000 0000 to 1111 1111 
wheen Sait De Sum of D0-D8 of data ID through last data word. 


Notes: 


EP = even parity for DO-D7. 


Table 6.13. EIA-708 Digital Closed Captioning Ancillary Data Packet Format. 
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8-bit Data 10-bit Data 
oS D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 
0 0 0 0 0 0 0 0 0 0 
neapn 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 
data ID (DID) Ds EP 1 1 1 1 0 1 0 0 
SDID D8 EP 0 0 0 0 0 0 0 0 
data count (DC) D8 EP 0 (0) 0 1 0 0 0 0 
D8 EP erc5 erc4 crc3 crc2 crcl crc0 0 0 
active CRC D8 EP ercll1 crcl0 cred crc8 crce7 crc6 0 0 
D8 EP Vv 0 ercl5 ercl4 ercl3 ercl2 0 0 
D8 EP ered crc4 crc3 crc2 crcl crc0 0 0 
field CRC D8 EP crell | crcl0 crc9 crc8 crc7 crc6 0 0 
D8 EP Vv 0 erel5 ercl4 ercl3 crcl2 0 0 
ancillary flags D8 EP 0 ues ida idh eda edh 0 0 
active flags D8 EP 0 ues ida idh eda edh 0 0 
field flags D8 EP 0 ues ida idh eda edh 0 0 
1 0 0 0 0 0 0 0 0 0 
1 0 0 0 0 0 0 0 0 0 
1 0 0 0 0 0 0 0 0 0 
reserved 1 0 0 (0) 0 0) 0 0 0 0 
1 0 0 0 0 0 0 0 0 0 
1 0 0 0 0 0 0 0 0 0 
1 0 0 0 0 0 0 0 0 0 
=5 Sum of D0-D8 of data ID through last reserved word. 
check sum BS Preset to all zeros; carry is ignored. 
Notes: 


EP = even parity for DO-D7. 


Table 6.14. Error Detection Ancillary Data Packet Format. 
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Pin Signal Pin Signal 
1 clock 14 clock- 
2 system ground A 15 system ground B 
3 D9 16 D9- 
4 D8 17 D8- 
5 D7 18 D7- 
6 D6 19 D6- 
7 D5 20 D5- 
8 D4 21 D4- 
9 D3 22 D3- 

10 D2 23 D2- 

11 D1 24 D1- 

12 DO 25 DO- 

13 cable shield 


Table 6.15. 25-Pin Parallel Interface Connector Pin 
Assignments. For 8-bit interfaces, D9-D2 are used. 


25-pin Parallel Interface 


This interface is used to transfer SDTV resolu- 
tion 4:2:2 YCbCr data. 8-bit or 10-bit data and a 
clock are transferred. The individual bits are 
labeled DO-D9, with D9 being the most signifi- 
cant bit. The pin allocations for the signals are 
shown in Table 6.15. 

Y has a nominal 10-bit range of 040y- 
3ACy. Values less than 040y or greater than 
3ACy may be present due to processing. Dur- 
ing blanking, Y data should have a value of 
040y, unless other information is present. 

Cb and Cr have a nominal 10-bit range of 
040y-3C0y. Values less than 040y or greater 
than 3C0y may be present due to processing. 
During blanking, CbCr data should have a 
value of 200y, unless other information is 
present. 

Signal levels are compatible with ECL- 
compatible balanced drivers and receivers. 


The generator must have a balanced output 
with a maximum source impedance of 110 Q; 
the signal must be 0.8-2.0V peak-to-peak mea- 
sured across a 110-Q load. At the receiver, the 
transmission line must be terminated by 110 
+10Q. 


27 MHz Parallel Interface 

This BT.656 and SMPTE 125M interface is 
used for interlaced SDTV systems with an 
aspect ratio of 4:3. Y and multiplexed CbCr 
information at a sample rate of 13.5 MHz are 
multiplexed into a single 8-bit or 10-bit data 
stream, at a clock rate of 27 MHz. 

The 27 MHz clock signal has a clock pulse 
width of 18.5 +3 ns. The positive transition of 
the clock signal occurs midway between data 
transitions with a tolerance of +3 ns (as shown 
in Figure 6.2). 
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CLOCK -f “2 


TD TC 


DATA ne 


TW = 18.5 + 3 NS 
TC = 37 NS 
TD = 18.5 + 3 NS 


Tw 


Figure 6.2. 25-Pin 27 MHz Parallel Interface Waveforms. 


RELATIVE GAIN (DB) 


20 


0.1 1 10 100 


FREQUENCY (MHZ) 


Figure 6.3. Example Line Receiver Equalization 
Characteristics for Small Signals. 
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To permit reliable operation at intercon- 
nect lengths of 50-200 meters, the receiver 
must use frequency equalization, with typical 
characteristics shown in Figure 6.3. This 
example enables operation with a range of 
cable lengths down to zero. 


36 MHz Parallel Interface 

This BT-1302 and SMPTE 267M interface is 
used for interlaced SDTV systems with an 
aspect ratio of 16:9. Y and multiplexed CbCr 
information at a sample rate of 18 MHz are 
multiplexed into a single 8-bit or 10-bit data 
stream, ata clock rate of 36 MHz. 

The 36 MHz clock signal has a clock pulse 
width of 13.9 +2 ns. The positive transition of 
the clock signal occurs midway between data 
transitions with a tolerance of +2 ns (as shown 
in Figure 6.4. 

To permit reliable operation at intercon- 
nect lengths of 40-160 meters, the receiver 
must use frequency equalization, with typical 
characteristics shown in Figure 6.3. 


CLOCK an, 


93-pin Parallel Interface 


This interface is used to transfer 16:9 HDTV 
resolution R’G’B’ data, 4:2:2 YCbCr data, or 
4:2:2:4 YCbCrK data. The pin allocations for 
the signals are shown in Table 6.16. The most 
significant bits are R9, G9, and BY. 

When transferring 4:2:2 YCbCr data, the 
green channel carries Y information and the 
red channel carries multiplexed CbCr informa- 
tion. 

When transferring 4:2:2:4 YCbCrK data, 
the green channel carries Y information, the 
red channel carries multiplexed CbCr informa- 
tion, and the blue channel carries K (alpha key- 
ing) information. 

Y has a nominal 10-bit range of 040y- 
3ACy. Values less than 040, or greater than 
3ACy may be present due to processing. Dur- 
ing blanking, Y data should have a value of 
0404, unless other information is present. 

Cb and Cr have a nominal 10-bit range of 
040y-3C0y. Values less than 040y or greater 


TD 
DATA | x 


TW = 13.9 + 2 NS 
TC = 27.8 NS 
TD = 13.9 + 2 NS 


Figure 6.4. 25-Pin 36 MHz Parallel Interface Waveforms. 
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Pin Signal Pin Signal Pin Signal Pin Signal 
1 clock 26 GND 51 B2 76 GND 
2 G9 27 GND 52 Bl 77 GND 
3 G8 28 GND 53 BO 78 GND 
4 G7 29 GND 54 RO 79 B4- 
5 G6 30 GND 55 R8 80 B3- 
6 G5 31 GND 56 R7 81 B2- 
7 G4 32 GND 57 R6 82 Bl- 
8 G3 33 clock- 58 RS 83 BO- 
9 G2 34 G9- 59 R4 84 R9- 

10 Gl 35 G8- 60 R3 85 R8- 

11 GO 36 G7- 61 R2 86 R7- 

12 B9 37 G6- 62 R1 87 R6- 

13 B8 38 G5- 63 RO 88 R5- 

14 B7 39 G4- 64 GND 89 R4- 

15 B6 40 G3- 65 GND 90 R3- 

16 BS Al G2- 66 GND 91 R2- 

17 GND 42 Gl- 67 GND 92 R1- 

18 GND 43 GO0- 68 GND 93 RO- 

19 GND 44 B9- 69 GND 

20 GND 45 B8- 70 GND 

21 GND 46 B7- 71 GND 

22 GND 47 B6- 72 GND 

23 GND 48 B5- 73 GND 

24 GND 49 B4 74 GND 

25 GND 50 B3 75 GND 


Table 6.16. 93-Pin Parallel Interface Connector Pin 
Assignments. For 8-bit interfaces, bits 9-2 are used. 
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CLOCK 


TD 
|<] 
DATA Y 


TW = 6.73 + 1.48 NS 
TC = 13.47 NS 
TD = 6.73 + 1 NS 


Figure 6.5. 93-Pin 74.25 MHz Parallel Interface Waveforms. 


than 3C0y may be present due to processing. 
During blanking, CbCr data should have a 
value of 200y, unless other information is 
present. 

R’G’B' and K have a nominal 10-bit range 
of 040y-3ACy. Values less than 040, or 
greater than 3ACy may be present due to pro- 
cessing. During blanking, R’G’B’ data should 
have a value of 040y, unless other information 
is present. 

Signal levels are compatible with ECL- 
compatible balanced drivers and receivers. 
The generator must have a balanced output 
with a maximum source impedance of 110 Q; 
the signal must be 0.6-2.0V peak-to-peak mea- 
sured across a 110-Q load. At the receiver, the 
transmission line must be terminated by 110 
+10Q. 


74.25 MHz Parallel Interface 
This ITU-R BT.1120 and SMPTE 274M inter- 
face is primarily used for 16:9 HDTV systems. 


The 74.25 MHz clock signal has a clock 
pulse width of 6.73 +1.48 ns. The positive tran- 
sition of the clock signal occurs midway 
between data transitions with a tolerance of +1 
ns (as shown in Figure 6.5). 

To permit reliable operation at intercon- 
nect lengths greater than 20 meters, the 
receiver must use frequency equalization. 


74.176 MHz Parallel Interface 
This BT.1120 and SMPTE 274M interface is 
primarily used for 16:9 HDTV systems. 

The 74.176 MHz (74.25/1.001) clock signal 
has a clock pulse width of 6.74 +1.48 ns. The 
positive transition of the clock signal occurs 
midway between data transitions with a toler- 
ance of +1 ns (similar to Figure 6.5). 

To permit reliable operation at intercon- 
nect lengths greater than 20 meters, the 
receiver must use frequency equalization. 
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148.5 MHz Parallel Interface 
This BT:1120 and SMPTE 274M interface is 
used for 16:9 HDTV systems. 

The 148.5 MHz clock signal has a clock 
pulse width of 3.37 +0.74 ns. The positive tran- 
sition of the clock signal occurs midway 
between data transitions with a tolerance of 
+0.5 ns (similar to Figure 6.5). 

To permit reliable operation at intercon- 
nect lengths greater than 14 meters, the 
receiver must use frequency equalization. 


148.35 MHz Parallel Interface 
This BT.1120 and SMPTE 274M interface is 
used for 16:9 HDTV systems. 

The 148.35 MHz (148.5/1.001) clock signal 
has a clock pulse width of 3.37 +0.74 ns. The 
positive transition of the clock signal occurs 
midway between data transitions with a toler- 
ance of +0.5 ns (similar to Figure 6.5). 

To permit reliable operation at intercon- 
nect lengths greater than 14 meters, the 
receiver must use frequency equalization. 


Serial Interface 


The parallel formats can be converted to a 
serial format (Figure 6.6), allowing data to be 
transmitted using a 75-Q coaxial cable (or opti- 
cal fiber). Equipment inputs and outputs both 
use BNC connectors so that interconnect 
cables can be used in either direction. 

For cable interconnect, the generator has 
an unbalanced output with a source impedance 
of 75Q; the signal must be 0.8V +10% peak-to- 
peak measured across a 75-Q load. The 
receiver has an input impedance of 75Q.. 

In an 8bit environment, before serializa- 
tion, the 00yq and FFy codes during EAV and 
SAV are expanded to 10-bit values of 000y and 


3FFy, respectively. All other 8-bit data is 
appended with two least significant “0” bits 
before serialization. 

The 10 bits of data are serialized (LSB 
first) and processed using a scrambled and 
polarity-free NRZI algorithm: 


G(x) = (x9 +x44+1) (41) 


The input signal to the scrambler (Figure 
6.7) uses positive logic (the highest voltage 
represents a logical one; lowest voltage repre- 
sents a logical zero). 

The formatted serial data is output at the 
10x sample clock rate. Since the parallel clock 
may contain large amounts of jitter, deriving 
the 10x sample clock directly from an unfil- 
tered parallel clock may result in excessive sig- 
nal jitter. 

At the receiver, phase-lock synchronization 
is done by detecting the EAV and SAV 
sequences. The PLL is continuously adjusted 
slightly each scan line to ensure that these pat- 
terns are detected and to avoid bit slippage. 
The recovered 10x sample clock is divided by 
ten to generate the sample clock, although 
care must be taken not to mask word-related 
jitter components. The serial data is low- and 
high-frequency equalized, inverse scrambling 
performed (Figure 6.8), and deserialized. 


270 Mbps Serial Interface 

This BT.656 and SMPTE 259M interface (also 
called SDI) converts a 27 MHz parallel stream 
into a 270 Mbps serial stream. The 10x PLL 
generates a 270 MHz clock from the 27 MHz 
clock signal. This interface is primarily used 
for 4:3 interlaced SDTV systems. 
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PARALLEL 10 10 PARALLEL 
SHIFT SHIFT 
4:2:2 —j4— REGISTER [7] SCRAMBLER DESCRAMBLER }—*| praicter 4:2:2 
VIDEO VIDEO 
SAMPLE 
CLOCK 
SAV, EAV 
DETECT 
SERIAL 
' 
DIVIDE SAMPLE 
|—®——+| 
ao BY 10 CLOCK 
Figure 6.6. Serial Interface Block Diagram. 
SERIAL ENCODED 
BATA DQ DQ DQ DQ DQ D DQ DQ DQ DQ ee 
IN =| | | je =| + =| OUT 
(NRZ) (NRZI) 
G(X) = X9 + X44 G(X) =X +1 
Figure 6.7. Typical Scrambler Circuit. 
ENCODED SERIAL 
DATA DATA 
IN TP 8 OUT 
(NRZI) (NRZ) 


Figure 6.8. Typical Descrambler Circuit. 


7 HBBBE BEBE 
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360 Mbps Serial Interface 

This BT.1302 interface converts a 36 MHz par- 
allel stream into a 360 Mbps serial stream. The 
10x PLL generates a 360 MHz clock from the 
36 MHz clock signal. This interface is prima- 
rily used for 16:9 interlaced SDTV systems. 


540 Mbps Serial Interface 

This SMPTE 344M interface converts a 54 
MHz parallel stream, or two 27 MHz parallel 
streams, into a 540 Mbps serial stream. The 
10x PLL generates a 540 MHz clock from the 
54 MHz clock signal. This interface is prima- 
rily used for 4:3 progressive SDTV systems. 


1.485 Gbps Serial Interface 
This BT.1120 and SMPTE 292M interface mul- 
tiplexes two 74.25 MHz parallel streams (Y and 
ChbCr) into a single 1.485 Gbps serial stream. A 
20x PLL generates a 1.485 GHz clock from the 
74.25 MHz clock signal. This interface is used 
for 16:9 HDTV systems. 

Before multiplexing the two parallel 
streams together, line number and CRC infor- 


mation (Table 6.17) is added to each stream 
after each EAV sequence. The CRC is used to 
detect errors in the active video and EAV. It 
consists of two words generated by the polyno- 
mial: 


CRC =x}8 4x54 x441 


The initial value is set to zero. The calculation 
starts with the first active line word and ends at 
the last word of the line number (LN1). 


1.4835 Gbps Serial Interface 
This BT.1120 and SMPTE 292M interface mul- 
tiplexes two 74.176 (74.25/1.001) MHz parallel 
streams (Y and CbCr) into a single 1.4835 
(1.485/1.001) Gbps serial stream. A 20x PLL 
generates a 1.4835 GHz clock from the 74.176 
MHz clock signal. This interface is used for 
16:9 HDTV systems. 

Line number and CRC information is 
added as described for the 1.485 Gbps serial 
interface. 


a aan D8 D7 D6 D5 D4 D3 D2 D1 DO 
LNO D8 L6 L5 14 13 12 Ll LO 0 0 
IN1 D8 0 0 0 110 L9 18 17 0 0 
CRCO D8 crc8 crce7 crc6 cred crc4 cerc3 cerce2 ercl crc0 
CRC1 D8 ercl7 crcl6 erel5 ercl4 ercl3 ercel2 erell crcl0 crc9 


Table 6.17. Line Number and CRC Data. 
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SDTV—Interlaced 


Supported active resolutions, with their corre- 
sponding aspect ratios and frame refresh rates, 
are: 


720x480 9 4:3 29.97 Hz 
720x576 4:3 25.00 Hz 
960x480 16:9 29.97 Hz 
960x576 16:9 25.00 Hz 


4:2:2 YCbCr Parallel Interface 

The ITU-R BT.656 and BT.1302 parallel inter- 
faces were developed to transfer BT:601 4:2:2 
YCbCr digital video between equipment. 
SMPTE 125M and 267M further clarify the 
operation for 525-line systems. 

Figure 6.9 illustrates the timing for one 
scan line for the 4:3 aspect ratio, using a 27 
MHz sample clock. Figure 6.10 shows the tim- 
ing for one scan line for the 16:9 aspect ratio, 
using a 36 MHz sample clock. The 25-pin paral- 
lel interface is used. 


4:2:2 YCbCr Serial Interface 

BT.656 and BT:1302 also define a YCbCr serial 
interface. The 10-bit 4:2:2 YCbCr parallel 
streams shown in Figure 6.9 or 6.10 are serial- 


ized using the 270 or 360 Mbps serial interface. 


4:4:4:4 YCbCrK Parallel Interface 

The ITU-R BT-799 and BT.1303 parallel inter- 
faces were developed to transfer BT.601 4:4:4:4 
YCbCrK digital video between equipment. K is 
an alpha keying signal, used to mix two video 
sources, discussed in Chapter 7. SMPTE RP- 
175 further clarifies the operation for 525-line 
systems. 


Multiplexing Structure 

Two transmission links are used. Link A con- 
tains all the Y samples plus those Cb and Cr 
samples located at even-numbered sample 
points. Link B contains samples from the key- 
ing channel and the Cb and Cr samples from 
the odd-numbered sampled points. Although it 
may be common to refer to Link A as 4:2:2 and 
Link B as 2:2:4, Link A is not a true 4:2:2 signal 
since the CbCr data was sampled at 13.5 MHz, 
rather than 6.75 MHz. 

Figure 6.11 shows the contents of links A 
and B when transmitting 4:4:4:4 YCbCrK video 
data. Figure 6.12 illustrates the contents when 
transmitting R’G’B’K video data. If the keying 
signal (K) is not present, the K sample values 
should have a 10-bit value of 3ACy. 

Figure 6.13 illustrates the YCbCrK timing 
for one scan line for the 4:3 aspect ratio, using 
a 27 MHz sample clock. Figure 6.14 shows the 
YCbCrK timing for one scan line for the 16:9 
aspect ratio, using a 36 MHz sample clock. 
Two 25-pin parallel interfaces are used. 


4:4:4:4 YCbCrK Serial Interface 

BT:799 and BT.1303 also define a YCbCr serial 
interface. The two 10-bit 4:2:2 YCbCr parallel 
streams shown in Figure 6.13 or 6.14 are serial- 
ized using two 270 or 360 Mbps serial inter- 
faces. SMPTE RP-175 further clarifies the 
operation for 525-line systems. 


RGBK Parallel Interface 

BT.799 and BT:1303 also support transferring 
BT:601 R’G'BK digital video between equip- 
ment. For additional information, see the 
4:4:4:4 YCbCrK parallel interface. SMPTE RP- 
175 further clarifies the operation for 525-line 
systems. The G’ samples are sent in the Y loca- 
tions, the R’ samples are sent in the Cr loca- 
tions, and the B’ samples are sent in the Cb 
locations. 
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y BT.601 H SIGNAL 


START OF DIGITAL LINE START OF DIGITAL ACTIVE LINE 


NEXT LINE 
EAV CODE BLANKING SAV CODE CO-SITED CO-SITED 
xP cly|[cly]c]y]c]y (al aa Fs BT.656 
Blo] R|1/B] 2] RI] 3 R | 719 42:2 
Z]o0 ty) 2 2 718 F 
VIDEO 
1440 
_ ~|~ na 
1716 (1728) 
Be 


Figure 6.9. BT.656 and SMPTE 125M Parallel Interface Data For One Scan Line. 525- 


line; 4:2:2 YCbCr; 720 active samples per line; 27 MHz clock; 10-bit system. The 
values for 625-line systems are shown in parentheses. 


BT.601 H SIGNAL 


START OF DIGITAL LINE 


START OF DIGITAL ACTIVE LINE NEXT LINE 


EAV CODE 


BLANKING SAV CODE CO-SITED CO-SITED 


BT.1302 
4:2:2 
VIDEO 
4 360 (376) 
~ =| =| 
2288 (2304) 
~ 


Figure 6.10. BT.1302 and SMPTE 267M Parallel Interface Data For One Scan Line. 


525-line; 4:2:2 YCbCr; 960 active samples per line; 36 MHz clock; 10-bit system. The 
values for 625-line systems are shown in parentheses. 
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SAMPLE NUMBER 


0 1 2 3 4 5 6 7 


Y Y Y Y Y Y Y Y 


LINK A CB | CB | CB | CB |} CB | CB |} CB | CB 


CR | CR | CR | CR} CR | CR | CR | CR 


LINK B K K K K K K K K 


Figure 6.11. Link Content Representation for 
YCbCrK Video Signals. 


SAMPLE NUMBER 


(0) 1 2 3 4 6 7 
G G G G G G G 
LINK A B B B B B B B 
R R R R R R R 
LINK B K K K K K K K 


Figure 6.12. Link Content Representation 


for R'G°B’K Video Signals. 


START OF DIGITAL LINE 


EAV CODE 


BLANKING 


SAV CODE 


4 268 (280) 


BT.601 H SIGNAL 


START OF DIGITAL ACTIVE LINE 


CO-SITED CO-SITED 


Cc} y}|c;}Y}]C;Y] cy; Y Cc Vi 3 
By} oOo; R]| 1] By] 2] RY] 3 R 719 F 
0 0 2 2 718 F 


1440 


1716 (1728) 


BLANKING SAV CODE 


NEXT LINE 


4:2:2 
STREAM 
(LINK A) 


4:2:2 
STREAM 
(LINK B) 


Figure 6.13. BT.799 and SMPTE RP-175 Parallel Interface Data For One Scan Line. 525-line; 


4:4:4:4 YCbCrK; 720 active samples per line; 27 MHz clock; 10-bit system. The values for 625-line 


systems are shown in parentheses. 
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START OF DIGITAL LINE 


EAV CODE 


BLANKING SAV CODE 


BT.601 H SIGNAL 


START OF DIGITAL ACTIVE LINE 


NEXT LINE 


CO-SITED 


CO-SITED 


4:2:2 
STREAM 


(LINK A) 
2288 (2304) 
EAV CODE BLANKING SAV CODE 
3 x} 2] 0] 2] 0 210] 3 xf cl k|c]|K]c}]K]c]k c | k [3 4:2:2 
F Y}o] 4] 0] 4 o| 4, F Y}B]o] RA] 1/B] 2] R] 3 R | 959 JF 
F Z]o] 0] 0] 0 o| of F Zz] 1 1 3 3 959 F STREAM 
(LINK B) 


Figure 6.14. BT.1303 Parallel Interface Data For One Scan Line. 525-line; 4:4:4:4 YCbCrK; 960 
active samples per line; 36 MHz clock; 10-bit system. The values for 625-line systems are shown 


in parentheses. 


RGBK Serial Interface 

BT.799 and BT.1303 also define a R’G’B’K 
serial interface. The two 10-bit R’G’B’K paral- 
lel streams are serialized using two 270 or 360 
Mbps serial interfaces. 


SDTV—Progressive 


Supported active resolutions, with their corre- 
sponding aspect ratios and frame refresh rates, 
are: 


720x480 = 4:3 
720x576 4:3 


59.94 Hz 
50.00 Hz 


4:2:2 YCbCr Serial Interface 

ITU-R BT.1362 defines two 10-bit 4:2:2 YCbCr 
data streams (Figure 6.15), using a 27 MHz 
sample clock. SMPTE 294M further clarifies 
the operation for 525-line systems. 

What stream is used for which scan line is 
shown in Table 6.18. The two 10-bit parallel 
streams shown in Figure 6.15 are serialized 
using two 270 Mbps serial interfaces. 
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BT.1358, SMPTE 293M H SIGNAL 


START OF DIGITAL LINE START OF DIGITAL ACTIVE LINE NEXT LINE 


EAV CODE BLANKING SAV CODE CO-SITED CO-SITED 
BT.656 4:2:2 
STREAM 
(LINK A) 

268 (280) 
1716 (1728) 

EAV CODE BLANKING SAV CODE 

3 x} 2] 0] 2] 0 210] 3 xP cly|/cly|]c}]y]/cly c | Yy [3 BT.656 4:2:2 

F Y}o|] 4] 0] 4 o| 4, F Y}B] oo] R/1/B] 2] R] 3 R | 719 JF STREAM 

F Z}o] 0] 0] 0 o| of F Zo 0 2 2 718 F 
(LINK B) 


Figure 6.15. BT.1362 and SMPTE 294M Parallel Data For Two Scan Lines. 525-line; 4:2:2 YCbCr; 
720 active samples per line; 27 MHz clock; 10-bit system. The values for 625-line systems are 
shown in parentheses. 


525-Line System 625-Line System 
Link Link Link Link Link Link Link Link 
A B A B A B A B 
7 8 6 7 1 2 4 5 
9 10 : : 3 4 6 7 
522 523 : : 8 9 
523 524 524 525 621 622 
525 1 1 2 623 624 620 621 
2 3 3 4 625 1 622 623 
4 5 5 6 2 3 624 625 


Table 6.18. BT.1362 and SMPTE 294M Scan Line Numbering 
and Link Assignment. 
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HDTV—interlaced 


Supported active resolutions, with their corre- 
sponding aspect ratios and frame refresh rates, 
are: 


1920x1080 16:9 25.00 Hz 
1920x1080 16:9 29.97 Hz 
1920x1080 16:9 30.00 Hz 


4:2:2 YCbCr Parallel Interface 

The ITU-R BT1120 parallel interface was 
developed to transfer interlaced HDTV 4:2:2 
YCbCr digital video between equipment. 
SMPTE 274M further clarifies the operation 
for 29.97 and 30 Hz systems. 

Figure 6.16 illustrates the timing for one 
scan line for the 1920 x 1080 active resolutions. 
The 93-pin parallel interface is used with a 
sample clock rate of 74.25 MHz (25 or 30 Hz 
refresh) or 74.176 MHz (29.97 Hz refresh). 


START OF DIGITAL LINE 


EAV CODE BLANKING SAV CODE 


START OF DIGITAL ACTIVE LINE 


YI Y;]Y}Y] YY; YY] Y 
Oo; 1/2) 3); 4] 5] 6] 7 
Z 
|e 


4:2:2 YCbCr Serial Interface 

BT.1120 also defines a YCbCr serial interface. 
SMPTE 292M further clarifies the operation 
for 29.97 and 30 Hz systems. The two 10-bit 
4:2:2 YCbCr parallel streams shown in Figure 
6.16 are multiplexed together, then serialized 
using a 1.485 or 1.4835 Gbps serial interface. 


4:2:2:4 YCbCrK Parallel Interface 
BT.1120 also supports transferring HDTV 
4:2:2:4 YCbCrK digital video between equip- 
ment. SMPTE 274M further clarifies the oper- 
ation for 29.97 and 30 Hz systems. 

Figure 6.17 illustrates the timing for one 
scan line for the 1920 x 1080 active resolutions. 
The 93-pin parallel interface is used with a 
sample clock rate of 74.25 MHz (25 or 30 Hz 
refresh) or 74.176 MHz (29.97 Hz refresh). 


BT.709, SMPTE 274M H SIGNAL 


NEXT LINE 


v Y 3 
1918 | 1919 J F 
F 


Y 
CHANNEL 


2200 (2640) 


EAV CODE BLANKING SAV CODE 


3 Xf 2] 2] 2] 2 2/273 X}C];]C}|c}]cCc;} Cc} cy; c;c 
F Y{ oO; 0} 0; O Oo}; OF F Y{TB; R} By] RR} B}] RR} BI] R 
E ZiT 0; 0} 0; 0 Oo} OF F ZT 0; 0; 2; 2) 4] 4] 6] 6 


CBCR 
CHANNEL 


Cc Cc 3 
B R F 
1918 | 1918 | F 


Figure 6.16. BT.1120 and SMPTE 274M Parallel Interface Data For One Scan Line. 
1125-line; 29.97-, 30-, 59.94-, and 60-Hz systems; 4:2:2 YCbCr; 1920 active samples 
per line; 74.176, 74.25, 148.35, or 148.5 MHz clock; 10-bit system. The values for 25- 
and 50-Hz systems are shown in parentheses. 


122 ~= Chapter 6: Digital Video Interfaces 


BT.709, SMPTE 274M H SIGNAL 


START OF DIGITAL LINE START OF DIGITAL ACTIVE LINE NEXT LINE 


EAV CODE BLANKING SAV CODE 


4 


CHANNEL 
272 (712) 
2200 (2640) 
- 
EAV CODE BLANKING SAV CODE 
CBCR 

CHANNEL 

EAV CODE BLANKING SAV CODE 

K 

CHANNEL 


Figure 6.17. BT.1120 and SMPTE 274M Parallel Interface Data For One Scan Line. 
1125-line; 29.97-, 30-, 59.94-, and 60-Hz systems; 4:2:2:4 YCbCrK; 1920 active 
samples per line; 74.176, 74.25, 148.35, or 148.5 MHz clock; 10-bit system. The 
values for 25- and 50-Hz systems are shown in parentheses. 
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RGB Parallel Interface 

BT.1120 also supports transferring HDTV 
R’G’B’ digital video between equipment. 
SMPTE 274M further clarifies the operation 
for 29.97 and 30 Hz systems. 


Figure 6.18 illustrates the timing for one 
scan line for the 1920 x 1080 active resolutions. 
The 93-pin parallel interface is used with a 
sample clock rate of 74.25 MHz (25 or 30 Hz 
refresh) or 74.176 MHz (29.97 Hz refresh). 


BT.709, SMPTE 274M H SIGNAL 


START OF DIGITAL LINE START OF DIGITAL ACTIVE LINE NEXT LINE 


G G 3 
1918 | 1919 J F 
F 


EAV CODE BLANKING SAV CODE 


GREEN 


CHANNEL 
2200 (2640) 
— 
EAV CODE BLANKING SAV CODE 
3 xf o]o]o] 0 o| of 3 % R| RA] R| RR] R R R R [3 RED 
F Y} 4] 4] 4] 4 4/4] F Y 1/2] 3] 4] 5 7 1918 | 1919 | F 
F Z]o] 0] 0] 0 o| of F Zz F CHANNEL 
EAV CODE BLANKING SAV CODE 
3 xf ol o]o]o o| of 3 % B/ B| B|/ BI B B B B [3 ALE 
F Y}4]4] 4] 4 4/4] F Y 1/2] 3] 4] 5 7 1918 | 1919 | F 
F Z}o]o] 0] 0 o| of F Z F CHANNEL 


Figure 6.18. BT.1120 and SMPTE 274M Parallel Interface Data For One Scan Line. 
1125-line; 29.97-, 30-, 59.94-, and 60-Hz systems; R°G’B’; 1920 active samples per 
line; 74.176, 74.25, 148.35, or 148.5 MHz clock; 10-bit system. The values for 25- 
and 50-Hz systems are shown in parentheses. 
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HDTV—Progressive 


Supported active resolutions, with their corre- 
sponding aspect ratios and frame refresh rates, 
are: 


1280x720 16:9 23.98 Hz 
1280x720 16:9 24.00 Hz 
1280x720 16:9 25.00 Hz 
1280x720 16:9 29.97 Hz 
1280x720 16:9 30.00 Hz 
1280x720 16:9 50.00 Hz 
1280x720 16:9 59.94 Hz 
1280x720 16:9 60.00 Hz 
1920x1080 16:9 23.98 Hz 
1920 x 1080 16:9 24.00 Hz 
1920 x 1080 16:9 25.00 Hz 
1920x1080 16:9 29.97 Hz 
1920 x 1080 16:9 30.00 Hz 
1920 x 1080 16:9 50.00 Hz 
1920x1080 16:9 59.94 Hz 
1920 x 1080 16:9 60.00 Hz 


4:2:2 YCbCr Parallel Interface 

The ITU-R BT:1120 and SMPTE 274M parallel 
interfaces were developed to transfer progres- 
sive HDTV 4:2:2 YCbCr digital video between 
equipment. 

Figure 6.16 illustrates the timing for one 
scan line for the 1920 x 1080 active resolutions. 
The 93-pin parallel interface is used with a 
sample clock rate of 148.5 MHz (24, 25, 30, 50 
or 60 Hz refresh) or 148.35 MHz (23.98, 29.97 
or 59.94 Hz refresh). 

Figure 6.19 illustrates the timing for one 
scan line for the 1280 x 720 active resolutions. 
The 93-pin parallel interface is used with a 


sample clock rate of 74.25 MHz (24, 25, 30, 50 
or 60 Hz refresh) or 74.176 MHz (23.98, 29.97, 
or 59.94 Hz refresh). 


4:2:2:4 YCbCrK Parallel Interface 
BT.1120 and SMPTE 274M also support trans- 
ferring HDTV 4:2:2:4 YCbCrK digital video 
between equipment. 

Figure 6.17 illustrates the timing for one 
scan line for the 1920 x 1080 active resolutions. 
The 93-pin parallel interface is used with a 
sample clock rate of 148.5 MHz (24, 25, 30, 50 
or 60 Hz refresh) or 148.35 MHz (23.98, 29.97 
or 59.94 Hz refresh). 

Figure 6.20 illustrates the timing for one 
scan line for the 1280 x 720 active resolutions. 
The 93-pin parallel interface is used with a 
sample clock rate of 74.25 MHz (24, 25, 30, 50 
or 60 Hz refresh) or 74.176 MHz (23.98, 29.97 
or 59.94 Hz refresh). 


RGB Parallel Interface 

BT:1120 and SMPTE 274M also support trans- 
ferring HDTV R’G’B’ digital video between 
equipment. 

Figure 6.18 illustrates the timing for one 
scan line for the 1920 x 1080 active resolutions. 
The 93-pin parallel interface is used with a 
sample clock rate of 148.5 MHz (24, 25, 30, 50 
or 60 Hz refresh) or 148.35 MHz (23.98, 29.97 
or 59.94 Hz refresh). 

Figure 6.21 illustrates the timing for one 
scan line for the 1280 x 720 active resolutions. 
The 93-pin parallel interface is used with a 
sample clock rate of 74.25 MHz (24, 25, 30, 50 
or 60 Hz refresh) or 74.176 MHz (23.98, 29.97 
or 59.94 Hz refresh). 
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START OF DIGITAL LINE 


SMPTE 296M H SIGNAL 


START OF DIGITAL ACTIVE LINE NEXT LINE 


EAV CODE BLANKING SAV CODE 
yl ylyly/lyly]yly Y y [3 v 
o/1]/2/ 3/4] 5] 6] 7 1278 | 1279 | F 
F CHANNEL 
4 362 (692) 
\. _| 
1650 (1980 
EAV CODE BLANKING SAV CODE 
CBCR 
CHANNEL 


Figure 6.19. SMPTE 274M Parallel Interface Data For One Scan Line. 750-line; 59.94- 
and 60-Hz systems; 4:2:2 YCbCr; 1280 active samples per line; 74.176 or 74.25 MHz 
clock; 10-bit system. The values for 50-Hz systems are shown in parentheses. 


SMPTE 296M H SIGNAL 
START OF DIGITAL LINE START OF DIGITAL ACTIVE LINE NEXT LINE 
EAV CODE BLANKING SAV CODE 
Y 
CHANNEL 
362 (692) 
1650 (1980 
EAV CODE BLANKING SAV CODE 
CBCR 
CHANNEL 
EAV CODE BLANKING SAV CODE 
K 
CHANNEL 


Figure 6.20. SMPTE 274M Parallel Interface Data For One Scan Line. 750-line; 59.94- 
and 60-Hz systems; 4:2:2:4 YCbCrK; 1280 active samples per line; 74.176 or 74.25 
MHz clock; 10-bit system. The values for 50-Hz systems are shown in parentheses. 
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START OF DIGITAL LINE 


EAV CODE 


BLANKING 


362 (692) 


SAV CODE 


SMPTE 296M H SIGNAL 


START OF DIGITAL ACTIVE LINE 


G G 3 
1278 | 1279 | F 
F 


1650 (1980) 


BLANKING 


BLANKING 


SAV CODE 


SAV CODE 


NEXT LINE 


GREEN 
CHANNEL 


RED 
CHANNEL 


BLUE 
CHANNEL 


Figure 6.21. SMPTE 274M Parallel Interface Data For One Scan Line. 750-line; 59.94- 
and 60-Hz systems; R°G°B’; 1280 active samples per line; 74.176 or 74.25 MHz clock; 
10-bit system. The values for 50-Hz systems are shown in parentheses. 
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Pro-Video Composite 
Interfaces 


Digital composite video is essentially a digital 
version of a composite analog (M) NTSC or (B, 
D, G, H, I) PAL video signal. The sample clock 
rate is four times Fc: about 14.32 MHz for 
(M) NTSC and about 17.73 MHz for (B, D, G, 
H, I) PAL. 

Usually, both 8-bit and 10-bit interfaces are 
supported, with the 10-bit interface used to 
transmit 2 bits of fractional video data to mini- 
mize cumulative processing errors and to sup- 
port 10-bit ancillary data. 

Table 6.19 lists the digital composite levels. 
Video data may not use the 10-bit values of 
0004-003, and 3FCy-3FF y, or the 8-bit values 
of 00y and FFy, since they are used for timing 
information. 


NTSC Video Timing 


There are 910 total samples per scan line, as 
shown in Figure 6.22. Horizontal count 0 corre- 
sponds to the start of active video, and a hori- 
zontal count of 768 corresponds to the start of 
horizontal blanking. 

Sampling is along the +I and +Q axes (83°, 
123°, 213°, and 303°). The sampling phase at 
horizontal count 0 of line 7, Field 1 is on the +I 
axis (123°). 

The sync edge values, and the horizontal 
counts at which they occur, are defined as 
shown in Figure 6.23 and Tables 6.20-6.22. 8- 
bit values for one color burst cycle are 45, 83, 
75, and 37. The burst envelope starts at hori- 
zontal count 857, and lasts for 43 clock cycles, 
as shown in Table 6.20. Note that the peak 
amplitudes of the burst are not sampled. 


Video (M) (B, D, G, H, 1) 
Level NTSC PAL 
peak chroma 972 rereaees 1023) 
white 800 844 
peak burst 352 380 
black 280 256 
blank 240 256 
peak burst 128 128 
peak chroma 104 128 
sync 16 4 


Table 6.19. 10-Bit Video Levels for Digital Composite Video Signals. 
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DIGITAL 
BLANKING DIGITAL ACTIVE LINE 


142 SAMPLES 768 SAMPLES 
(768-909) (0-767) 


TOTAL LINE 


910 SAMPLES 
(0-909) 


Figure 6.22. Digital Composite (M) NTSC Analog and Digital Timing Relationship. 


END OF END OF 
ANALOG DIGITAL 
LINE LINE 


768 (60) 
784 (41) 


50% 
785 (17) 
|-+— 


787 (4) 


Figure 6.23. Digital Composite (M) NTSC Sync Timing. The horizontal counts with the 
corresponding 8-bit sample values are in parentheses. 
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8-bit 10-bit 
Sample Hex Value Hex Value 
Fields 1, 3 Fields 2, 4 Fields 1, 3 Fields 2, 4 
768-782 3C 3C OFO OFO 
783 3A 3A OE9 OE9 
784 29 29 OA4 OA4 
785 11 11 044 044 
786 04 04 011 011 
787-849 04 04 010 010 
850 06 06 017 017 
851 17 17 05C 05C 
852 2F 2F OBC OBC 
853 3C 3C OEF OEF 
854-856 3C 3C OFO OFO 
857 3C 3C OFO OFO 
858 3D 3B OF4 OEC 
859 37 41 0DC 104 
860 36 42 0D6 10A 
861 4B 2D 12C OB4 
862 49 2F 123 OBD 
863 25 53 096 14A 
864 2D 4B 0B3 12D 
865 53 25 14E 092 
866 4B 2D 12D 0B3 
867 25 53 092 14E 
868 2D 4B 0B3 12D 
869 53 25 14E 092 
870 4B 2D 12D 0B3 
871 25 53 092 14E 
872 2D 4B 0B3 12D 
873 53 25 14E 092 


Table 6.20a. Digital Values During the Horizontal Blanking Intervals for Digital 


Composite (M) NTSC Video Signals. 
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8-bit 10-bit 
Sample Hex Value Hex Value 
Fields 1, 3 Fields 2, 4 Fields 1, 3 Fields 2, 4 
874 4B 2D 12D OB3 
875 25 53 092 14E 
876 2D 4B 0B3 12D 
877 53 25 14E 092 
878 4B 2D 12D OB3 
879 25 53 092 14E 
880 2D 4B 0B3 12D 
881 53 25 14E 092 
882 4B 2D 12D OB3 
883 25 53 092 14E 
884 2D 4B 0B3 12D 
885 53 25 14E 092 
886 4B 2D 12D OB3 
887 25 53 092 14E 
888 2D 4B 0B3 12D 
889 53 25 14E 092 
890 4B 2D 12D 0OB3 
891 25 53 092 14E 
892 2D 4B 0B3 12D 
893 53 25 14E 092 
894 4A 2E 129 OB7 
895 2A 4E OA6 13A 
896 33 45 0CD 113 
897 44 34 112 OCE 
898 3F 39 OFA OE6 
899 3B 3D OEC OF4 
900-909 3C 3C OFO OFO 


Table 6.20b. Digital Values During the Horizontal Blanking Intervals for Digital 
Composite (M) NTSC Video Signals. 
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Fields 1, 3 Fields 2, 4 
Sample 8-bit 10-bit Sample 8-bit 10-bit 
Hex Value Hex Value Hex Value Hex Value 
768-782 3C OFO 313-327 3C OFO 
783 3A 0OE9 328 3A 0OE9 
784 29 OA4 329 29 OA4 
785 11 044 330 11 044 
786 04 011 331 04 011 
787-815 04 010 332-360 04 010 
816 06 017 361 06 017 
817 17 05C 362 17 05C 
818 2F OBC 363 2F OBC 
819 3C OEF 364 3C OEF 
820-327 3C OFO 365-782 3C OFO 
328 3A OE9 783 3A 0OE9 
329 29 OA4 784 29 OA4 
330 11 044 785 al 044 
331 04 011 786 04 011 
332-360 04 010 787-815 04 010 
361 06 017 816 06 017 
362 17 05C 817 17 05C 
363 2F OBC 818 2F OBC 
364 3C OEF 819 3C OEF 
365-782 3C OFO 820-327 3C OFO 


Table 6.21. Equalizing Pulse Values During the Vertical Blanking Intervals for Digital 
Composite (M) NTSC Video Signals. 
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Fields 1, 3 Fields 2, 4 
Sample 8-bit 10-bit Sample 8-bit 10-bit 
Hex Value Hex Value Hex Value Hex Value 
782 3C OFO 327 3C OFO 
783 3A 0OE9 328 3A 0E9 
784 29 OA4 329 29 OA4 
785 11 044 330 11 044 
786 04 011 331 04 011 
787-260 04 010 332-715 04 010 
261 06 017 716 06 017 
262 17 05C M7 17 05C 
263 2F OBC 718 2F OBC 
264 3C OEF 719 3C OEF 
265-327 3C OFO 720-782 3C OFO 
328 3A 0E9 783 3A 0OE9 
329 29 OA4 784 29 OA4 
330 11 044 785 11 044 
331 04 011 786 04 011 
332-715 04 010 787-260 04 010 
716 06 017 261 06 017 
M7 17 05C 262 17 05C 
718 2F OBC 263 2F OBC 
719 3C OEF 264 3C OEF 
720-782 3C OFO 265-327 3C OFO 


Table 6.22. Serration Pulse Values During the Vertical Blanking Intervals for Digital 
Composite (M) NTSC Video Signals. 
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To maintain zero SCH phase, horizontal 
count 784 occurs 25.6 ns (33° of the subcarrier 
phase) before the 50% point of the falling edge 
of horizontal sync, and horizontal count 785 
occurs 44.2 ns (57° of the subcarrier phase) 
after the 50% point of the falling edge of hori- 
zontal sync. 


PAL Video Timing 


There are 1135 total samples per line, except 
for lines 313 and 625 which have 1137 samples 
per line, making a total of 709,379 samples per 
frame. Figure 6.24 illustrates the typical line 
timing. Horizontal count 0 corresponds to the 
start of active video, and a horizontal count of 
948 corresponds to the start of horizontal 
blanking. 


DIGITAL 
BLANKING 


DIGITAL ACTIVE LINE 


Sampling is along the +U and +V axes (0°, 
90°, 180°, and 270°), with the sampling phase 
at horizontal count 0 of line 1, Field 1 on the +V 
axis (90°). 

8-bit color burst values are 95, 64, 32, and 
64, continuously repeated. The swinging burst 
causes the peak burst (82 and 95) and zero 
burst (64) samples to change places. The burst 
envelope starts at horizontal count 1058, and 
lasts for 40 clock cycles. 

Sampling is not H-coherent as with (M) 
NTSC, so the position of the sync pulses 
change from line to line. Zero SCH phase is 
defined when alternate burst samples have a 
value of 64. 


187 SAMPLES 
(948-1134) 


TOTAL LINE 


948 SAMPLES 
(0-947) 


1135 SAMPLES 
(0-1134) 


Figure 6.24. Digital Composite (B, D, G, H, 1) PAL Analog and Digital Timing Relationship. 
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Ancillary Data 


Ancillary data packets are used to transmit 
information (such as digital audio, closed cap- 
tioning, and teletext data) during the blanking 
intervals. ITU-R BT.1364 and SMPTE 291M 
describe the ancillary data formats. 

The ancillary data formats are the same as 
for digital component video, discussed earlier 
in this chapter. However, instead of a 3-word 
preamble, a one-word ancillary data flag is 
used, with a 10-bit value of 3FCy. There may 
be multiple ancillary data flags following the 
TRS-ID, with each flag identifying the begin- 
ning of another ancillary packet. 

Ancillary data may be present within the 
following word number boundaries (see Fig- 
ures 6.25 through 6.30). 


END OF 
ANALOG 
LINE 


END OF 
DIGITAL 
LINE 


NTSC PAL 

795-849 972-1035 horizontal sync period 
795-815 972-994 equalizing pulse periods 
340-360 404-426 

795-260 972-302 vertical sync periods 
340-715 404-869 


User data may not use the 10-bit values of 
000y-003y and 3FCy-3FFy, or the 8-bit values 
of 00y and FFy, since they are used for timing 
information. 


25-pin Parallel Interface 


The SMPTE 244M parallel interface is based 
on that used for 27 MHz 4:2:2 digital compo- 
nent video (Table 6.15), except for the timing 
differences. This interface is used to transfer 


Figure 6.25. 


787 


790-794 795-849 


TRS-ID ANC DATA 
(OPTIONAL) 


(M) NTSC TRS-ID and Ancillary Data Locations During Horizontal Sync Intervals. 
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50% 50% 
1 1 1 | 1 
787 790-794 795-260 340-715 
TRS-ID ANC DATA ANC DATA 
(OPTIONAL) (OPTIONAL) 


Figure 6.26. (M) NTSC TRS-ID and Ancillary Data Locations During Vertical Sync Intervals. 


- . ~ 50% / 
A i ry ij 
1 1 1 1 
787 


790-794 795-815 340-360 
TRS-ID ANC DATA ANC DATA 
(OPTIONAL) (OPTIONAL) 


Figure 6.27. (M) NTSC TRS-ID and Ancillary Data Locations During Equalizing Pulse Intervals. 
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END OF END OF 
ANALOG DIGITAL 
LINE LINE 


948 954 
957 
50% 
958 
1 | 1 | 1 | 
962 


967-971 972-1035 


TRS-ID ANC DATA 
(OPTIONAL) 


Figure 6.28. (B, D, G, H, 1) PAL TRS-ID and Ancillary Data Locations During Horizontal 
Sync Intervals. 


50% 50% 
1 1 1 1 
962 967-971 972-302 404-869 
TRS-ID ANC DATA ANC DATA 
(OPTIONAL) (OPTIONAL) 


Figure 6.29. (B, D, G, H, |) PAL TRS-ID and Ancillary Data Locations During Vertical Sync Intervals. 
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_ 50% f \ 50% 
KH 
962 


967-971 972-994 


TRS-ID ANC DATA 
(OPTIONAL) 


404-426 


ANC DATA 
(OPTIONAL) 


Figure 6.30. (B, D, G, H, 1) PAL TRS-ID and Ancillary Data Locations During Equalizing 


Pulse Intervals. 


SDTV resolution digital composite data. 8-bit 
or 10-bit data and a 4x Fsc clock are trans- 
ferred. 

Signal levels are compatible with ECL- 
compatible balanced drivers and_ receivers. 
The generator must have a balanced output 
with a maximum source impedance of 110 Q; 
the signal must be 0.8-2.0V peak-to-peak mea- 
sured across a 110-Q load. At the receiver, the 
transmission line must be terminated by 110 
+10Q. 

The clock signal is a 4x Fsc square wave, 
with a clock pulse width of 35 +5 ns for (M) 
NTSC or 28 +5 ns for (B, D, G, H, I) PAL. The 
positive transition of the clock signal occurs 
midway between data transitions with a toler- 
ance of +5 ns (as shown in Figure 6.31). 

To permit reliable operation at intercon- 
nect lengths of 50-200 meters, the receiver 
must use frequency equalization, with typical 
characteristics shown in Figure 6.3. This 
example enables operation with a range of 
cable lengths down to zero. 


Serial Interface 


The parallel format can be converted to a 
SMPTE 259M serial format (Figure 6.32), 
allowing data to be transmitted using a 75-Q 
coaxial cable (or optical fiber). This interface 
converts the 14.32 or 17.73 MHz parallel 
stream into a 143 or 177 Mbps serial stream. 
The 10x PLL generates the 143 or 177 MHz 
clock from the 14.32 or 17.73 MHz clock sig- 
nal. 

For cable interconnect, the generator has 
an unbalanced output with a source impedance 
of 75Q; the signal must be 0.8V +10% peak-to- 
peak measured across a 75-Q load. The 
receiver has an input impedance of 75Q.. 

The 10 bits of data are serialized (LSB 
first) and processed using a scrambled and 
polarity-free NRZI algorithm: 


G(x) = (x9 +x44+1)(« 41) 


This algorithm is the same as used for digital 
component video discussed earlier. In an 8-bit 
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10-BIT 
DIGITAL 
COMPOSITE 
VIDEO 


4X FSC 
CLOCK 


CLOCK 


TD 


DATA 


X 


TW 
TC = 


75-OHM 
COAX 


TRS 
ID 
INSERTION 
10 
—f~H+ SHIFT L_.! scramBLer 
REGISTER 
10X 40X FSC 
PLL CLOCK 


= 35 + 5 NS (M)NTSC; 28 + 5 NS (B,D, G,H, |) PAL 
69.84 NS (M) NTSC; 56.39 NS (B, D, G, H, I) PAL 
TD = 35 + 5 NS (M)NTSC; 28 + 5 NS (B,D,G,H, 1) PAL 


DESCRAMBLER 


Figure 6.31. Digital Composite Video Parallel Interface Waveforms. 


SHIFT 
REGISTER 


TRS-ID 
DETECT 


40X FSC 
PLL 


DIVIDE 
BY 10 


Figure 6.32. Serial Interface Block Diagram. 


10-BIT 
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COMPOSITE 
VIDEO 
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environment, 8-bit data is appended with two 
least significant “O” bits before serialization. 

The input signal to the scrambler (Figure 
6.7) uses positive logic (the highest voltage 
represents a logical one; lowest voltage repre- 
sents a logical zero). The formatted serial data 
is output at the 40x Fg¢ rate. 

At the receiver, phase-lock synchronization 
is done by detecting the TRS-ID sequences. 
The PLL is continuously adjusted slightly each 
scan line to ensure that these patterns are 
detected and to avoid bit slippage. The recov- 
ered 10x clock is divided by ten to generate the 
4x Fgc sample clock. The serial data is low- 
and high-frequency equalized, inverse scram- 
bling performed (Figure 6.8), and deserialized. 


TRS-ID 

When using the serial interface, a special five- 
word sequence, known as the TRS-ID, must be 
inserted into the digital video stream during 
the horizontal sync time. The TRSID is 
present only following sync leading edges 
which identify a horizontal transition, and 


occupies horizontal counts 790-794, inclusive 
(NTSC) or 967-971, inclusive (PAL). Table 
6.23 shows the TRS-ID format; Figures 6.25 
through 6.30 show the TRS-ID locations for 
digital composite (M) NTSC and (B, D, G, H, 
I) PAL video signals. 

The line number ID word at horizontal 
count 794 (NTSC) or 971 (PAL) is defined as 
shown in Table 6.24. 

PAL requires the reset of the TRS-ID posi- 
tion relative to horizontal sync on lines 1 and 
314 due to the 25-Hz offset. All lines have 1135 
samples except lines 313 and 625, which have 
1137 samples. The two additional samples on 
lines 313 and 625 are numbered 1135 and 1136, 
and occur just prior to the first active picture 
sample (sample 0). 

Due to the 25-Hz offset, the samples occur 
slightly earlier each line. Initial determination 
of the TRS-ID position should be done on line 
1, Field 1, or a nearby line. The TRS-ID loca- 
tion always starts at sample 967, but the dis- 
tance from the leading edge of sync varies due 
to the 25-Hz offset. 


‘easel D8 D7 D6 D5 D4 D3 D2 D1 DO 
TRS word 0 1 1 1 1 1 1 1 1 1 1 
TRS word 1 0 0 0 0 0 0 0 0 0 0 
TRS word 2 0 0 0 0 0 0 0 0 0 0 
TRS word 3 0 0 0 0 0 0 0 0 0 0 
line number ID D8 EP line number ID 


Notes: 
EP = even parity for DO-D7. 


Table 6.23. TRS-ID Format. 
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D2 Di Do (M) NTSC (B, D, G, H, 1) PAL 

0 (0) (0) line 1-263 field 1 line 1-313 field 1 

(0) 0 1 line 264-525 field 2 line 314-625 field 2 

(0) 1 (0) line 1-263 field 3 line 1-313 field 3 

0 1 1 line 264-525 field 4 line 314-625 field 4 

1 0 0 not used line 1-313 field 5 

1 (0) 1 not used line 314-625 field 6 

1 1 0 not used line 1-313 field 7 

1 1 1 not used line 314-625 field 8 
D7-D3 (M) NTSC (B, D, G, H, 1) PAL 

1<x<30 line number 1-30 [264-293] line number 1-30 [314-343] 
x=31 line number > 31 [294] line number > 31 [344] 
x=0 not used not used 


Table 6.24. Line Number ID Word at Horizontal Count 794 (NTSC) or 971 (PAL). 
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Serial Data Transport Interface (SDTI) 


SMPTE 305M and ITU-R BT:1381 define a 
Serial Data Transport Interface (SDTI) that 
enables transferring data between equipment. 
The physical layer uses the 270 or 360 Mbps 
BT.656 and SMPTE 259M digital component 
video serial interface. Figure 6.33 illustrates 
the signal format. 

A 53-word header is inserted immediately 
after the EAV sequence, specifying the source, 
destination, and data format. Table 6.25 illus- 
trates the header contents. 

The payload data is defined by other appli- 
cation-specific standards, such as SMPTE 


326M. It may consist of MPEG 2 program or 
transport streams, DV streams, etc., and uses 
either 8-bit words plus even parity and D8 or 9- 
bit words plus D8. 


Line Number 

The line number specifies a value of 1-525 
(525-line systems) or 1-625 (625-line systems). 
L0 is the least significant bit. 


Line Number CRC 

The line number CRC applies to the data ID 
through the line number, for the entire 10 bits. 
CO is the least significant bit. It is an 18-bit 
value, with an initial value set to all ones: 


CRC =x}8 + x°4+x44x1 
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HEADER 


<>m 
<> >on 


USER DATA 
(PAYLOAD) 


Figure 6.33. SDTI Signal Format. 


Code and AAI 

The 4-bit code value (CD3-CD0) specifies the 
length of the payload (the user data contained 
between the SAV and EAV sequences): 


0000 4:2:2 YCbCr video data 
0001 1440 word payload 
(uses 270 Mbps interface) 
0010 1920 word payload 
(uses 360 Mbps interface) 
1000 143 Mbps digital composite video 


The 4-bit authorized address identifier (AAI) 
value, AAI3—AAIO, specifies the format of the 
destination and source addresses: 


0000 unspecified format 
0001 IPv6 address 


Destination and Source Addresses 

These specify the address of the source and 
destination devices. A universal address is indi- 
cated when all address bits are zero and AAI3— 
AAIO = 0000. 


Block Type 

The block type value specifies the segmenta- 
tion of the payload. BL7-BL6 indicate the pay- 
load block structure: 


00 fixed block size without ECC 
01 fixed block size with ECC 

10 ~—_— unassigned 

11 variable block size 


BL5-BLO indicate the segmentation for fixed 
block sizes. Variable block sizes are indicated 
by BL7-BLO having a value of 11000001. The 
ECC format is application-dependent. 


Payload CRC Flag 
The CRCF bit indicates whether or not the pay- 
load CRC is present at the end of the payload: 


0 no CRC 
1 CRC present 
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10-bit Data 
hie D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 
0 0 0 0 0 0 0 0 0 0 
ancillary data 
flag (ADF) 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 
data ID (DID) D8 EP 0 1 0 0 (0) 0 0 0 
SDID D8 EP 0 0 0 0 0 0 0 1 
data count (DC) D8 EP 0 0 1 (0) 1 1 1 0 
D8 EP L7 16 L5 1A 13 12 Ll LO 
line number = 
D8 EP 0 0 0 0 0 0 19 18 
‘ D8 C8 C7 C6 C5 C4 C3 C2 Cl CO 
line number 
CRC D8 C17 C16 C15 C14 C13 C12 Cll C10 (o) 
code and AAI D8 EP AAI3 AAI2 AATI AAIO CD3 CD2 CD1 CDO 
D8 EP DA7 DA6 DA5 DA4 DA3 DA2 DAI DAO 
destinati D8 EP DAI5 DA14 DA13 DA12 DAI1 DA10 DAY DA8 
lestination 
address 
D8 EP DA127 | DA126 | DA125 | DA124 | DA123 | DA122 | DA121 | DA120 
D8 EP SA7 SA6 SA5 SA4 SA3 SA2 SA1 SAO 
D8 EP SA15 SA14 SA13 SA12 SA11 SA10 SA9 SA8 
source 
address 
D8 EP SA127 | SA126 | SA125 | SA124 | SA123 | SA122 | SA121 | SA120 
Notes: 


EP = even parity for DO-D7. 


Table 6.25a. SDTI Header Structure. 
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10-bit Data 
De D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 

block type D8 EP BL7 BL6 BL5 BIA BL3 BL2 BL1 BLO 
payload CRC flag D8 EP 0 0 0 0 0 0 0 CRCF 
reserved D8 EP 0 0 0 0 0 0 0 0 
reserved D8 EP 0 0 0 0 0 0 0 0 
reserved D8 EP 0 0 0 0 0 0 0 0 
reserved D8 EP 0 0 0 0 0 0 0 0 
reserved D8 EP 0 0 0 0 0 0 0 0 

D8 C8 C7 C6 C5 C4 C3 C2 C1 (o) 
header CRC —= 

D8 C17 C16 C15 C14 C13 C12 Cll C10 C9 

aa Sum of D0-D8 of data ID through last header CRC word. 
check sum D8 sae 

Preset to all zeros; carry is ignored. 
Notes: 


EP = even parity for DO-D7. 


Table 6.25b. SDTI Header Structure. 


Header CRC 

The header CRC applies to the code and AAT 
word through the reserved data, for the entire 
10 bits. CO is the least significant bit. It is an 18- 
bit value, with an initial value set to all ones: 


CRC = x8 +x°4x44x1 


144 Chapter 6: Digital Video Interfaces 


High Data-Rate Serial Data Transport 
Interface (HD-SDTI) 


SMPTE 348M defines a High Data-Rate Serial 
Data Transport Interface (HD-SDTI) that 
enables transferring data between equipment. 
The physical layer uses the 1.485 (or 1.485/ 
1.001) Gbps SMPTE 292M digital component 
video serial interface. 

Figure 6.34 illustrates the signal format. 
Two data channels are multiplexed onto the 
single HD-SDTI stream such that one 74.25 (or 
74.25/1.001) MHz data stream occupies the Y 
data space and the other 74.25 (or 74.25/1.001) 
MHz data stream occupies the CbCr data 
space. 

A 49-word header is inserted immediately 
after the line number CRC data, specifying the 
source, destination, and data format. Table 
6.26 illustrates the header contents. 


The payload data is defined by other appli- 
cation-specific standards. It may consist of 
MPEG 2 program or transport streams, DV 
streams, etc., and uses either 8-bit words plus 
even parity and D8 or 9-bit words plus D8. 


Code and AAI 

The 4-bit code value (CD3-CD0) specifies the 
length of the payload (the user data contained 
between the SAV and EAV sequences): 


0000 
0001 
0010 
0011 
1000 
1001 
1010 
1011 
1100 
1101 
1110 
1111 


4:2:2 YCbCr video data 

1440 word payload 

1920 word payload 

1280 word payload 

143 Mbps digital composite video 
2304 word payload (extended mode) 
2400 word payload (extended mode) 
1440 word payload (extended mode) 
1728 word payload (extended mode) 
2880 word payload (extended mode) 
3456 word payload (extended mode) 
3600 word payload (extended mode) 


a 


HEADER 


<>m 
a) 


<>on 


USER DATA 
(PAYLOAD) 


C CHANNEL 


a 


HEADER 


<>m 
i) 


<>on 


USER DATA 
(PAYLOAD) 


Y CHANNEL 


Figure 6.34. HD-SDTI Signal Format. 
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10-bit Data 
atid D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 
0 0 0 0 0 0 0 0 0 0 
ancillary data 
flag (ADF) 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 

data ID (DID) D8 EP 0 1 0 0 0 0 0 0) 
SDID D8 EP (0) 0 0 0 0 0 1 0 
data count (DC) D8 EP 0 0 1 0 1 0 1 0 
code and AAI D8 EP AAI3 AAI2 AATL AAIO CD3 CD2 CD1 CDO 

D8 EP DA7 DA6 DA5 DA4 DA3 DA2 DA1 DAO 
destinati D8 EP DA15 DA14 DAI13 DA12 DAI1 DA10 DAY DA8 

estination 

address 

D8 EP DA127 | DA126 | DA125 | DA124 | DA123 | DA122 | DA121 | DA120 

D8 EP SA7 SA6 SA5 SA4 SA3 SA2 SA1 SAO 

D8 EP SA15 SA14 SA13 SA12 SA11 SA10 SA9 SA8 
source 
address 

D8 EP SA127 | SA126 | SA125 | SA124 | SA123 | SA122 | SA121 | SA120 
block type D8 EP BL7 BL6 BL5 BIA BL3 BL2 BL1 BLO 
reserved D8 EP 0 0 0 0 0 (0) 0 0 
reserved D8 EP 0 0 0 0 0 0 0 0 

Notes: 


EP = even parity for DO-D7. 


Table 6.26a. HD-SDTI Header Structure. 
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10-bit Data 
pe D8 D7 D6 D5 D4 D3 D2 D1 DO 
(MSB) 

reserved D8 EP 0 0 0 0 0 0 0 0 
reserved D8 EP 0 0 0 0 0 0 0 0 
reserved D8 EP 0 0 0 0 0 0 0 0 
reserved D8 EP 0 0 0 0 0 0 0 0 

D8 C8 C7 C6 C5 C4 C3 C2 C1 (o) 
header CRC —= 

D8 C17 C16 C15 C14 C13 C12 Cll C10 (o) 

ao Sum of DO-D8 of data ID through last header CRC word. 
check sum D8 cate 

Preset to all zeros; carry is ignored. 
Notes: 


EP = even parity for DO-D7. 


Table 6.26b. HD-SDTI Header Structure. 


The extended mode advances the timing of 
the SAV sequence, shortening the blanking 
interval, so that the payload data rate remains 
aconstant 129.6 (or 129.6/1.001) MBps. 

The 4bit authorized address identifier 
(AAI) format is the same as for SMPTE 305M. 


Destination and Source Addresses 
The source and destination address formats 
are the same as for SMPTE 305M. 


Block Type 

The block type format is the same as for 
SMPTE 305M. However, different payload 
segmentations are used for a given fixed block 
type value. 


Header CRC 
The header CRC format is the same as for 
SMPTE 305M. 
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YCbCr Values: 8-bit Data 


Y has a nominal range of 10q—-EBy. Values less 
than 10y or greater than EBy may be present 
due to processing. Cb and Cr have a nominal 
range of 10q-FOy. Values less than 10, or 
greater than F0y may be present due to pro- 
cessing. YCbCr data may not use the values of 
00y and FFy since those values may be used 
for timing information. 

During blanking, Y data should have a 
value of 10, and CbCr data should have a value 
of 801, unless other information is present. 


YCbCr Values: 10-bit Data 


For higher accuracy, pro-video solutions typi- 
cally use 10-bit YCbCr data. Y has a nominal 
range of 040q-3ACy. Values less than 040, or 
greater than 3ACy may be present due to pro- 
cessing. Cb and Cr have a nominal range of 
040y-3C0y. Values less than 040y or greater 
than 3C0y may be present due to processing. 
The values 000q-003y and 3FCy-3FFy may 
not be used to avoid timing contention with 8 
bit systems. 

During blanking, Y data should have a 
value of 0404 and CbCr data should have a 
value of 200y, unless other information is 
present. 


RGB Values: 8-bit Data 


Consumer solutions typically use 8-bit R’G’B’ 
data, with a range of 00q-FFy. During blank- 
ing, R’G’B’ data should have a value of 00y, 
unless other information is present. 

Pro-video solutions that support 8&bit 
R’G’B’ data typically use a range of 10q-EBy. 
Values less than 10y or greater than EBy may 
be present due to processing. During blanking, 


R’G’B’ data should have a value of 10y, unless 
other information is present. 


RGB Values: 10-bit Data 


For higher accuracy, pro-video solutions typi- 
cally use 10-bit R’G’B’ data, with a nominal 
range of 040q-3ACy. Values less than 040, or 
greater than 3ACy may be present due to pro- 
cessing. The values 000q-003y and 3FCy- 
3FFy may not be used to avoid timing conten- 
tion with 8-bit systems. 

During blanking, R’G’B’ data should have 
a value of 040y, unless other data is present. 


“Standard” Video Interface 


The “standard” video interface has been used 
for years, with the control signal names and 
timing reflecting the video standard. Sup- 
ported active resolutions and sample clock 
rates are dependent on the video standard and 
aspect ratio. 

Devices usually support multiple data for- 
mats to simplify using them in a wide variety of 
applications. 


Video Data Formats 

The 24-bit 4:4:4 YCbCr data format is shown in 
Figure 6.35. Y, Cb, and Cr are each 8 bits, and 
all are sampled at the same rate, resulting in 24 
bits of data per sample clock. Pro-video solu- 
tions typically use a 30-bit interface, with the Y, 
Cb, and Cr streams each being 10 bits. YO, 
Cb0, and Cr0 are the least significant bits. 

The 16-bit 4:2:2 YCbCr data format is 
shown in Figure 6.36. Cb and Cr are sampled 
at one-half the Y sample rate, then multiplexed 
together. The CbCr stream of active data 
words always begins with a Cb sample. Pro- 
video solutions typically use a 20-bit interface, 
with the Y and CbCr streams each being 10 
bits. 
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BLANKING ACTIVE VIDEO 


Y 1 


ONE SCAN LINE 


| 24-BIT 
4:4:4 
VIDEO 
Figure 6.35. 24-Bit 4:4:4 YCbCr Data Format. 
BLANKING ACTIVE VIDEO 
16-BIT 
4:2:2 
ONE SCAN LINE VIDEO 
Figure 6.36. 16-Bit 4:2:2 YCbCr Data Format. 
8-BIT 
4:2:2 
VIDEO 


BLANKING ACTIVE VIDEO 


ONE SCAN LINE 


Figure 6.37. 8-Bit 4:2:2 YCbCr Data Format. 
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The 8&-bit 4:2:2 YCbCr data format is shown 
in Figure 6.37. The Y and CbCr streams from 
the 16-bit 4:2:2 YCbCr format are simply mulTti- 
plexed at 2x the sample clock rate. The YCbCr 
stream of active data words always begins with 
a Cb sample. Pro-video solutions typically use a 
10-bit interface. 

Tables 6.27 and 6.28 illustrate the 15-bit 
RGB, 16-bit RGB, and 24-bit RGB formats. For 
the 15-bit RGB format, the unused bit is some- 
times used for keying (alpha) information. RO, 
GO, and BO are the least significant bits. 


Control Signals 
In addition to the video data, there are four 
control signals: 


HSYNC# horizontal sync 
VSYNC# vertical sync 
BLANK# blanking 

CLK 1x or 2x sample clock 


For the 8bit and 10-bit 4:2:2 YCbCr data 
formats, CLK is a 2x sample clock. For the 
other data formats, CLK is a 1x sample clock. 
For sources, the control signals and video data 
are output following the rising edge of CLK. 
For receivers, the control signals and video 
data are sampled on the rising edge of CLK. 

While BLANK# is negated, active R’G’B’ 
or YCbCr video data is present. 

To support video sources that do not gen- 
erate a line-locked clock, a DVALID# (data 
valid) signal may also be used. While 
DVALID# is asserted, valid data is present. 

HSYNC¢# is asserted during the horizontal 
sync time each scan line, with the leading edge 
indicating the start of a new line. The amount 
of time that HSYNC# is asserted is usually the 
same as that specified by the video standard. 


VSYNC# is asserted during the vertical 
sync time each field or frame, with the leading 
edge indicating the start of a new field or 
frame. The number of scan lines that VSYNC# 
is asserted is usually same as that specified by 
the video standard. 

For interlaced video, if the leading edges of 
VSYNC# and HSYNC# are coincident, the field 
is Field 1. If the leading edge of VSYNC# 
occurs mid-line, the field is Field 2. For nonin- 
terlaced video, the leading edge of VSYNC# 
indicates the start of a new frame. Figure 6.38 
illustrates the typical HSYNC# and VSYNC# 
relationships. 


Receiver Considerations 

Assumptions should not be made about the 
number of samples per line or horizontal 
blanking interval. Other wise, the implementa- 
tion may not work with all sources. 

To ensure compatibility between various 
sources, horizontal counters should be reset 
by the leading edge of HSYNC#, not by the 
trailing edge of BLANK#. 

To handle real-world sources, a receiver 
should use a “window” for detecting whether 
Field 1 or Field 2 is present. For example, if the 
leading edge of VSYNC# occurs within +64 1x 
clock cycles of the leading edge of HSYNC#, 
the field is Field 1. Otherwise, the field is Field 
2. 

Some video sources indicate sync timing 
by having Y data be an 8-bit value less than 
10y. However, most video ICs do not do this. In 
addition, to allow real-world video and test sig- 
nals to be passed through with minimum dis- 
ruption, many ICs now allow the Y data to have 
a value less than 10y during active video. Thus, 
receiver designs assuming sync timing is 
present on the Y channel may no longer work. 
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24-bit 16-bit 15-bit 24-bit 16-bit 8-bit 
RGB RGB RGB 4:4:4 4:2:2 4:2:2 
(5,6,5) (5,5,5) YCbCr YCbCr YCbCr 

R7 Cr7 

R6 Cr6 

Rd Cr5 

R4 Cr4 

R3 Cr3 

R2 Cr2 

R1 Cr1 

RO Cr0 

G7 R4 - Y7 Y7 Cb7, Y7, Cr7 
G6 R3 R4 Y6 Y6 Cb6, Y6, Cr6 
G5 R2 R3 YS YS Cb5, Y5, Cr5 
G4 R1 R2 Y4 Y4 Cb4, Y4, Cr4 
G3 RO R1 Y3 Y3 Cb3, Y3, Cr3 
G2 G5 RO Y2 Y2 Cb2, Y2, Cr2 
Gl G4 G4 Yl Yl Cbl, Y1, Cr1 
GO G3 G3 YO YO Cb0, YO, Cr0 
B7 G2 G2 Cb7 Cb7, Cr7 

B6 G1 G1 Cb6 Cb6, Cr6 

BS GO GO Cb5 Cb5, Cr5 

B4 B4 B4 Cb4 Cb4, Cr4 

B3 B3 B3 Cb3 Cb3, Cr3 

B2 B2 B2 Cb2 Cb2, Cr2 

Bl Bl Bl Cb1 Cbl, Cr1 

BO BO BO Cb0 Cb0, Cr0 


Table 6.27. Transferring YCbCr and RGB Data over a 16-bit or 24-bit Interface. 
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24-bit 16-bit 15-bit 24-bit 16-bit 8-bit 
RGB RGB RGB 4:4:4 4:2:2 4:2:2 
(5,6,5) (5,5,5) YCbCr YCbCr YCbCr 
R4 - Y7 
R3 R4 Y6 
R2 R3 YS 
R1 R2 Y4 
RO R1 Y3 
G5 RO Y2 
G4 G4 Yl 
G3 G3 YO 
R7 G2 G2 Cr7 Cb7, Cr7 
R6 Gl Gl Cr6 Cb6, Cr6 
Rd GO GO Cr5 Cb5, Crd 
R4 B4 B4 Cr4 Cb4, Cr4 
R3 B3 B3 Cr3 Cb3, Cr3 
R2 B2 B2 Cr2 Cb2, Cr2 
R1 Bl Bl Cr1 Chl, Cr1 
RO BO BO Cr0 Cb0, Cr0 
G7 R4 - Y7 Y7 Cb7, Y7, Cr7 
G6 R3 R4 Y6 Y6 Cb6, Y6, Cr6 
G5 R2 R3 YS YS Cb5, Y5, Cr5 
G4 R1 R2 Y4 Y4 Cb4, Y4, Cr4 
G3 RO R1 Y3 Y3 Cb3, Y3, Cr3 
G2 G5 RO Y2 Y2 Cb2, Y2, Cr2 
Gl G4 G4 Yl Yl Cbl, Y1, Cr1 
GO G3 G3 YO YO Cb0, YO, Cr0 
B7 G2 G2 Cb7 Cb7, Cr7 
B6 Gl G1 Cb6 Cb6, Cr6 
BS GO GO Cb5 Cb5, Crd 
B4 B4 B4 Cb4 Cb4, Cr4 
B3 B3 B3 Cb3 Cb3, Cr3 
B2 B2 B2 Cb2 Cb2, Cr2 
Bl Bl Bl Cb1 Chl, Cr1 
BO BO BO Cb0 Cb0, Cr0 


Table 6.28. Transferring YCbCr and RGB Data over a 32-bit Interface. 
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Figure 6.38. Typical HSYNC# and VSYNC# Relationships (Not to Scale). 


Video Module Interface (VMI) 


VMI (Video Module Interface) was developed 
in cooperation with several multimedia IC 
manufacturers. The goal was to standardize 
the video interfaces between devices such as 
MPEG decoders, NTSC/PAL decoders, and 
graphics chips. 


Video Data Formats 

The VMI specification specifies an 8-bit 4:2:2 
YCbCr data format as shown in Figure 6.39. 
Many devices also support the other YCbCr 
and R’G’B’ formats discussed in the “Standard 
Video Interface” section. 


Control Signals 
In addition to the video data, there are four 
control signals: 


HREF horizontal blanking 
VREF vertical sync 
VACTIVE active video 
PIXCLK 2x sample clock 


For the 8-bit and 10-bit 4:2:2 YCbCr data 
formats, PIXCLK is a 2x sample clock. For the 
other data formats, PIXCLK is a 1x sample 
clock. For sources, the control signals and 
video data are output following the rising edge 
of PIXCLK. For receivers, the control signals 
and video data are sampled on the rising edge 
of PIXCLK. 
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While VACTIVE is asserted, active R’G’B’ 
or YCbCr video data is present. Although tran- 
sitions in VACTIVE are allowed, it is intended 
to allow a hardware mechanism for cropping 
video data. For systems that do not support a 
VACTIVE signal, HREF can generally be con- 
nected to VACTIVE with minimal loss of func- 
tion. 

To support video sources that do not gen- 
erate a line-locked clock, a DVALID# (data 
valid) signal may also be used. While 
DVALID# is asserted, valid data is present. 

HREF is asserted during the active video 
time each scan line, including during the verti- 
cal blanking interval. 

VREF is asserted for 6 scan line times, 
starting one-half scan line after the start of ver- 
tical sync. 

For interlaced video, the trailing edge of 
VREF is used to sample HREF If HREF is 
asserted, the field is Field 1. If HREF is 


BLANKING 


1 1 1 1 1 1 17 C;}Y}]C;]Y]C}]Y]cCc!yY 
0 0 0) 0 0 0) oF By oo}; R| 1 By] 2] R{ 3 
0) 0) 2 2 
|—* ~|~ 


negated, the field is Field 2. For noninterlaced 
video, the leading edge of VREF indicates the 
start of a new frame. Figure 6.40 illustrates the 
typical HREF and VREF relationships. 


Receiver Considerations 

Assumptions should not be made about the 
number of samples per line or horizontal 
blanking interval. Other wise, the implementa- 
tion may not work with all sources. 

Video data has input setup and hold times, 
relative to the rising edge of PIXCLK, of 5 and 
0 ns, respectively. 

VACTIVE has input setup and hold times, 
relative to the rising edge of PIXCLK, of 5 and 
0 ns, respectively. 

HREF and VREF both have input setup 
and hold times, relative to the rising edge of 
PIXCLK, of 5 and 5 ns, respectively. 


HREF 


ACTIVE VIDEO 


c Y 
R N 
Inn] IN] 


| 


ONE SCAN LINE 


Figure 6.39. VMI 8-bit 4:2:2 YCbCr Data for One Scan Line. 
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Figure 6.40. VMI Typical HREF and VREF Relationships (Not to Scale). 


“BT.656” Interface 


The BT-656 interface for ICs is based on the 
pro-video BT.656-type parallel interfaces, dis- 
cussed earlier in this chapter (Figures 6.1 and 
6.9). Using EAV and SAV sequences to indicate 
video timing reduces the number of pins 
required. The timing of the H, V, and F signals 
for common video formats is illustrated in 
Chapter 4. 

Standard IC signal levels and timing are 
used, and any resolution can be supported. 


Video Data Formats 
8-bit or 10-bit 4:2:2 YCbCr data is used, as 
shown in Figures 6.1 and 6.9. Although 


sources should generate the four protection 
bits in the EAV and SAV sequences, receivers 
may choose to ignore them due to the reliabil- 
ity of point-to-point transfers between chips. 


Control Signals 

CLK is a 2x sample clock. For sources, the 
video data is output following the rising edge 
of CLK. For receivers, the video data is sam- 
pled on the rising edge of CLK. 

To support video sources that do not gen- 
erate a line-locked clock, a DVALID# (data 
valid) signal may also be used. While 
DVALID# is asserted, valid data is present. 
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Zoomed Video Port (ZV Port) 


Used on laptops, the ZV Port is a point-to-point 
uni-directional bus between the PC Card host 
adaptor and the graphics controller. It enables 
video data to be transferred real-time directly 
from the PC Card into the graphics frame 
buffer. 

The PC Card host adaptor has a special 
multimedia mode configuration. If a non-ZV PC 
Card is plugged into the slot, the host adaptor 
is not switched into the multimedia mode, and 
the PC Card behaves as expected. Once a ZV 
card has been plugged in and the host adaptor 
has been switched to the multimedia mode, 
the pin assignments change. As shown in 
Table 6.29, the PC Card signals A6—A25, 
SPKR#, INPACK#, and IOIS16# are replaced 
by ZV Port video signals (YO-Y7, CbCr0- 
CbCr7, HREF, VREF, and PCLK) and 4-chan- 


nel audio signals (MCLK, SCLK, LRCK, and 
SDATA). 


Video Data Formats 
16-bit 4:2:2 YCbCr data is used, as shown in 
Figure 6.36. 


Control Signals 
In addition to the video data, there are four 
control signals: 


HREF horizontal reference 
VREF vertical sync 
PCLK 1x sample clock 


HREF, VREF, and PCLK have the same 
timing as the VMI interface discussed earlier 
in this chapter. 


PC Card ZV Port PC Card ZV Port PC Card ZV Port 
Signal Signal Signal Signal Signal Signal 
A25 CbCr7 Al7 Yl AQ YO 
A24 CbCr5 Al6 CbCr2 A8 Y2 
A23 CbCr3 Al5 CbCr4 AZT SCLK 

A22 CbCr1 Al4 Y6 A6 MCLK 
A21 CbCr0 Al3 y4 SPKR# SDATA 
A20 Y7 Al2 CbCr6 IOIS16# PCLK 
Alg9 Y5 All VREF INPACK# LRCK 

Als Y3 Al0 HREF 


Table 6.29. PC Card vs. ZV Port Signal Assignments. 
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Video Interface Port (VIP) 


The VESA VIP specification is an enhancement 
to the “BT:656” interface for ICs, previously 
discussed. The primary application is to inter- 
face up to four devices to a graphics controller 
chip, although the concept can easily be 
applied to other applications. 

There are three sections to the interface: 


Host Interface: 
VIPCLK host clock 
HADO-HAD7 host address/data bus 
HCTL host control 


Video Interface: 


PIXCLK 
VIDO-VID15 


video sample clock 
video data bus 


System Interface: 


VRST# 
VIRQ# 


reset 
interrupt request 


The host interface signals are provided by 
the graphics controller. Essentially, a 2-, 4-, or 
8-bit version of the PCI interface is used. VIP- 
CLK has a frequency range of 25-33 MHz. PIX- 
CLK has a maximum frequency of 75 MHz. 


Video Interface 

As with the “BT:656” interface, special four- 
word sequences are inserted into the &bit 
4:2:2 YCbCr video stream to indicate the start 
of active video (SAV) and end of active video 
(EAV). These sequences also indicate when 
horizontal and vertical blanking are present 
and which field is being transmitted. 

VIP modifies the BT.656 EAV and SAV 
sequences as shown in Table 6.30. BT.656 uses 
four protection bits (PO—-P3) in the status word 
since it was designed for long cable connec- 


tions between equipment. With chip-to-chip 
interconnect, this protection isn’t required, so 
the bits are used for other purposes. The tim- 
ing of the H, V, and F signals for common video 
formats are illustrated in Chapter 4. The status 
word for VIP is defined as: 


T=“0" fortaskB T= “1” for task A 
F = “0” for Field 1 F = “1” for Field 2 
V = “1” during vertical blanking 

H = “0” at SAV H = “1” at EAV 


The task bit, T, is programmable. If BT.656 
compatibility is required, it should always be a 
“1.” Otherwise, it may be used to indicate 
which one of two data streams are present: 
stream A = “1” and stream B = “0.” Alternately, 
T may be a “0” when raw 2x oversampled VBI 
data is present, and a “1” otherwise. 

The noninterlaced bit, N, indicates 
whether the source is progressive (“1”) or 
interlaced (“0”). This bit is valid only during 
the EAV sequence of the last active line. 

The repeat bit, R, is a “1” if the current 
field is a repeat field. This occurs only during 
3:2 pull-down. This bit is valid only during the 
EAV sequence of the last active line. The 
repeat bit (R), in conjunction with the noninter- 
laced bit (N), enables the graphics controller 
to handle Bob and Weave, as well as 3:2 pull- 
down, (further discussed in Chapter 7) in 
hardware. 

The extra flag bit, E, is a “1” if another byte 
follows the EAV. Table 6.31 illustrates the extra 
flag byte. This bit is valid only during EAV 
sequences. If the E bit in the extra byte is “1,” 
another extra byte immediately follows. This 
allows chaining any number of extra bytes 
together as needed. 

Unlike pro-video interfaces, code 00q may 
used during active video data to indicate an 
invalid video sample. This is used to accommo- 
date scaled video and square pixel timing. 
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8-bit Data 
D7 
(MSB) D6 D5 D4 D3 D2 D1 DO 
1 1 1 1 1 1 1 1 
preamble 0 0 0 0 0 0 0 0 
0 0 0 0 0 0 0 0 
status word T F Vv H N R 0 E 
Table 6.30. VIP EAV and SAV Sequence. 
8-bit Data 
D7 
(MSB) DG D5 D4 D3 D2 D1 DO 
extra byte 1 0 reserved E 
Table 6.31. VIP EAV Extra Byte. 
Video Data Formats Ancillary Data 


In the 8-bit mode (Figure 6.41), the video inter- 
face is similar to BT.656, except for the differ- 
ences mentioned. VID8-VID15 are not used. 

In the 16-bit mode (Figure 6.42), SAV 
sequences, EAV sequences, ancillary packet 
headers, CbCr video data, and odd-numbered 
ancillary data values are transferred across the 
lower 8 bits (VIDO-VID7). Y video data and 
even-numbered ancillary data values are trans- 
ferred across the upper 8 bits (VID8-VID15). 
Note that “skip data” (value 00}) during active 
video must also appear in 16-bit format to pre- 
serve the 16-bit data alignment. 


Ancillary data packets are used to transmit 
information (such as digital audio, closed cap- 
tioning, and teletext data) during the blanking 
intervals, as shown in Table 6.32. Unlike pro- 
video interfaces, the 00y and FFy values may 
be used by the ancillary data. Note that the 
ancillary data formats were defined prior to 
many of the pro-video ancillary data formats, 
and therefore may not match. 

I2 of the DID indicates whether Field 1 or 
Field 2 ancillary data is present: 


0 Field 1 
1 Field 2 
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H SIGNAL 


START OF DIGITAL ACTIVE LINE NEXT LINE 


START OF DIGITAL LINE 


EAV CODE BLANKING SAV CODE CO-SITED CO-SITED 
VIP 
4:2:2 
VIDEO 
Figure 6.41. VIP 8-Bit Interface Data for One Scan Line. 525-line; 720 active 
samples per line; 27 MHz clock. 
H SIGNAL 
NEXT LINE 


START OF DIGITAL ACTIVE LINE 


YT Y;]Y}Y] YY]; YY] Y 
Oo; 1/2) 3); 4] 5] 6] 7 


1920 


START OF DIGITAL LINE 


SAV CODE 


EAV CODE BLANKING 


Y Y Y 
1918 | 1919 
CHANNEL 


EAV CODE BLANKING SAV CODE 
F x F x Cc c | c le 
F Y F Y R B R JF CBCR 
z Zz 6 1918 | 1918 CHANNEL 


Figure 6.42. VIP 16-Bit Interface Data for One Scan Line. 1125-line; 1920 active 
samples per line; 74.25 MHz clock. 


como 
ono 
wwO 
wDO 
AWO 
ADO 
ono 
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8-bit Data 
pad D6 D5 D4 D3 D2 D1 DO 
(MSB) 

0 0 0 0 0 0 0 0 
aon 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 
data ID (DID) D8 EP 0 1 0 12 I 10 
SDID D8 EP programmable value 
data count (DC) D8 EP DC5 DC4 DC3 DC2 DC1 DCO 
internal header 0 0 0 0 0 DT3 DT2 DT1 DT0 
internal header 1 0 0 LN5 LN4 LN3 LN2 LN1 LNO 
user data word 0 D7 D6 D5 D4 D3 D2 D1 DO 
user data word N D7 D6 D5 D4 D3 D2 D1 DO 
check sum D8 EP CS5 CS4 CS3 CS2 CS1 CSO 
optional fill data D8 EP 0 0 0 0 0 0 
Notes: 


EP = even parity for DO-D5. 


Table 6.32. VIP Ancillary Data Packet General Format. 


I1-I0 of the DID indicate the type of ancil- 
lary data present: 


00 — start of field 

01 sliced VBI data, lines 1-23 

10 end of field VBI data, line 23 

11 sliced VBI data, line 24 to end of field 


The data count value (DC) specifies the 
number of D-words (4-byte blocks) of ancillary 
data present. Thus, the number of data words 


in the ancillary packet after the DID must bea 
multiple of four. 1-3 optional fill bytes may be 
added after the check sum data to meet this 
requirement. 

When I1-I0 are “00” or “10,” no user data is 
present, and the data count (DC) value should 
be “00000.” 
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Consumer Component 
Interfaces 


Digital Visual Interface (DVI) 


DVI was developed for transferring uncom- 
pressed digital video from a computer to a dis- 
play monitor. It may also be used for 
interfacing devices such as settop boxes to 
televisions. DVI enhances the Digital Flat 
Panel (DFP) Interface by supporting more for- 
mats and timings, and supporting the High- 
bandwidth Digital Content Protection (HDCP) 
specification to ensure unauthorized copying 
of material is prevented. The interface sup- 
ports VESA’s Extended Display Identification 
Data (EDID) standard, Display Data Channel 
(DDC) standard, and Monitor Timing Specifi- 
cation (DMT). DDC and EDID enable auto- 
matic display detection and configuration. 
“TFT data mapping” is supported as the mini- 


TMDS TMDS 


mum requirement: one pixel per clock, eight 
bits per channel, MSB justified. 

DVI uses transition-minimized differential 
signaling (TMDS). Eight bits of video data are 
converted to a 10-bit transition-minimized, DC- 
balanced value, which is then serialized. The 
receiver deserializes the data, and converts it 
back to eight bits. Thus, to transfer digital 
R’G'B’ or YCbCr data requires three TMDS 
signals that comprise one TMDS link. 

To further enhance DVI for the consumer 
market, Silicon Image developed a method of 
transferring digital audio over the existing 
clock channel. 


TMDS Links 

Either one or two TMDS links may be used, as 
shown in Figures 6.43 and 6.44, depending on 
the formats and timing required. A system sup- 
porting two TMDS links must be able to 
switch dynamically between formats requiring 
a single link and formats requiring a dual link. 


TMDS 


TRANSMITTER LINK RECEIVER 
Bo-B7 ——+ >| |BO-B7_, > BO-B7 
vsYNC ——+-=| ENCODER CHANNEL 0 RECEIVER |VSYNC || > VSYNC 
AND > AND HSYNG 
HSYNC_ ——+—) SERIALIZER DECODER iscoameel > HSYNC 
DE — os > DE 
G0-G7 |_| [GO-G7 | > G0-G7 
cto ENCODER CHANNEL 4 RECEIVER |CTLO = eno 
AND > AND CTL INTER 
CTL1 [| SERIALIZER DECODER [oo CHANNEL > CTL 
DE1 
| [>] ALIGNMENT 
RO-R7 || peti > RO-R7 
are ENCODER GuiANNELS RECEIVER |CTL2 = eis 
AND >| AND CTL3 
CTL3 [| SERIALIZER DECODER ——_*| > CTL3 
| [DE2 | 
CHANNEL C 
CLK >| > CLK 


Figure 6.43. DVI Single TMDS Link. 
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Figure 6.44. DVI Dual TMDS Link. 
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A single TMDS link is used to support all 
formats and timings requiring a clock rate of 
25-165 MHz. Formats and timings requiring a 
clock rate >165 MHz are implemented using 
two TMDS links, with each TMDS link operat- 
ing at one-half the frequency. Thus, the two 
TMDS links share the same clock and the 
bandwidth is shared evenly between the two 
links. 


Video Data Formats 

Typically, 24-bit R’'G'B’ data is transferred over 
a link, although any data format may be used, 
including 24-bit YCbCr for consumer applica- 
tions. For applications requiring more than 
eight bits per color component, the second 
TMDS link may be used for the additional least 
significant bits. 


Control Signals 
In addition to the video data, there are up to 14 
control signals: 


HSYNC horizontal sync 
VSYNC vertical sync 

DE data enable 
CTLO-CTL3 _ reserved (link 0) 
CTL4-CTL9 __ reserved (link 1) 
CLK 1x sample clock 


Figure 6.45. DVI Digital-Only Connector. 


While DE is a “1,” active video is pro- 
cessed. While DE is a “0,” the HSYNC, VSYNC 
and CTLO-CTL9 signals are _ processed. 
HSYNC and VSYNC may be either polarity. 


Digital-Only Connector 

The digital-only connector, which supports 
dual link operation, contains 24 contacts 
arranged as three rows of eight contacts, as 
shown in Figure 6.45. Table 6.33 lists the pin 
assignments. 


Digital-Analog Connector 

In addition to the 24 contacts used by the digi- 
tal-only connector, the 29-contact digital-analog 
connector contains five additional contacts to 
support analog video as shown in Figure 6.46. 
Table 6.34 lists the pin assignments. 


HSYNC horizontal sync 
VSYNC vertical sync 

RED analog red video 
GREEN analog green video 
BLUE analog blue video 


The operation of the analog signals is the 
same as for a standard VGA connector. 


Figure 6.46. DVI Digital-Analog Connector. 
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Pin Signal Pin Signal Pin Signal 
1 D2- 9 Di- 17 DO- 
2 D2 10 D1 18 DO 
3 shield 11 shield 19 shield 
4 D4- 12 D3- 20 D5- 
5 D4 13 D3 21 D5 
6 DDC SCL 14 +5V 22 shield 
7 DDC SDA 15 ground 23 CLK 
8 reserved 16 Hot Plug Detect} 24 CLK- 


Table 6.33. DVI Digital-Only Connector Signal Assignments. 


Pin Signal Pin Signal Pin Signal 
1 D2- 9 Di- 17 DO- 
2 D2 10 D1 18 DO 
3 shield 11 shield 19 shield 
4 D4- 12 D3- 20 D5- 
5 D4 13 D3 21 D5 
6 DDC SCL 14 +5V 22 shield 
7 DDC SDA 15 ground 23 CLK 
8 VSYNC 16 | Hot Plug Detect} 24 CLK- 

C1 RED C2 GREEN C3 BLUE 

C4 HSYNC C5 ground 


Table 6.34. DVI Digital-Analog Connector Signal Assignments. 
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Digital Flat Panel (DFP) Interface 


The VESA DFP interface was developed for 
transferring uncompressed digital video from 
a computer to a digital flat panel display. It sup- 
ports VESA’s Plug and Display (P&D) stan- 
dard, Extended Display Identification Data 
(EDID) standard, Display Data Channel 
(DDC) standard, and Monitor Timing Specifi- 
cation (DMT). DDC and EDID enable auto- 
matic display detection and configuration. 
Only “TFT data mapping” is supported: one 
pixel per clock, eight bits per channel, MSB 
justified. 

Like DVI, DFP uses transition-minimized 
differential signaling (TMDS). Eight bits of 
video data are converted to a 10-bit transition- 
minimized, DC-balanced value, which is then 
serialized. The receiver deserializes the data, 
and converts it back to eight bits. Thus, to 
transfer digital R’G’B’ data requires three 
TMDS signals that comprise one TMDS link. 
Cable lengths may be up to 5 meters. 


TMDS 
TRANSMITTER 


TMDS 
LINK 


TMDS Links 

A single TMDS link, as shown in Figure 6.47, 
supports formats and timings requiring a clock 
rate of 22.5-160 MHz. 


Video Data Formats 
24-bit R’G’B’ data is transferred over the link, 
as shown in Figure 6.47. 


Control Signals 
In addition to the video data, there are eight 
control signals: 


HSYNC horizontal syne 
VSYNC vertical sync 
DE data enable 
CTLO-CTL3 reserved 

CLK 1x sample clock 


While DE is a “1,” active video is pro- 
cessed. While DE is a “0,” the HSYNC, VSYNC 
and CTLO-CTL3 signals are _ processed. 
HSYNC and VSYNC may be either polarity. 


TMDS 
RECEIVER 


BO-B7 ——+-+| }B0-B7_, > BO-B7 
vsYNC ——+-»| ENCODER CHANNEL 0 RECEIVER |WSYNC , | > VSYNC 
AND AND HSYNC 
HSYNC_ ——+—) SERIALIZER DECODER Feel > HSYNC 
DE ——> yee | > DE 
G0-G7 >| [| GO-G7 | > G0-G7 
cto —{-[s| ENCODER CHANNEL 4 RECEIVER |CTLO eGo. 
AND AND Cmlet INTER 
CTL1 >| SERIALIZER DECODER [| CHANNEL > CTL1 
DE1 
‘ie [--—] ALIGNMENT 
RO-R7 =| seat | > RO-R7 
cTL2 —++-+| aoa CHANNEL 2 Besa Mea > CTL2 
CTL3 
CTL3 >| SERIALIZER DECODER [>| > CTL3 
Lz DE2 
CHANNEL GC 
CLK >| > CLK 


Figure 6.47. 


DFP TMDS Link. 
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Figure 6.48. DFP Connector. 


Pin Signal Pin Signal 
1 D1 11 D2 
2 Di1- 12 D2- 
3 shield 13 shield 
4 shield 14 shield 
5 CLK 15 DO 
6 CLK- 16 DO- 
7 ground 17 no connect 
8 +5V 18 Hot Plug Detect 
9 no connect 19 DDC SDA 
10 no connect 20 DDC SCL 


Table 6.35. DFP Connector Signal Assignments. 


Connector 


The 20-pin mini-D ribbon (MDR) connector 
contains 20 contacts arranged as two rows of 
ten contacts, as shown in Figure 6.48. Table 


6.35 lists the pin assignments. 
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Open LVDS Display Interface (OpenLDI) 


OpenLDI was developed for transferring 
uncompressed digital video from a computer 
to a digital flat panel display. It enhances the 
FPD-Link standard used to drive the displays 
of laptop computers, and adds support for 
VESA’s Plug and Display (P&D) standard, 
Extended Display Identification Data (EDID) 
standard, and Display Data Channel (DDC) 
standard. DDC and EDID enable automatic 
display detection and configuration. 

Unlike DVI and DFP OpenLDI uses low- 
voltage differential signaling (LVDS). Cable 
lengths may be up to 10 meters. 


LVDS Link 

The LVDS link, as shown in Figure 6.49, sup- 
ports formats and timings requiring a clock 
rate of 32.5-160 MHz. 

Eight serial data lines (AO-A7) and two 
sample clock lines (CLK1 and CLK2) are used. 
The number of serial data lines actually used is 
dependent on the pixel format, with the serial 
data rate being 7x the sample clock rate. The 
CLK2 signal is used in the dual pixel modes for 
backwards compatibility with FPD-Link receiv- 
ers. 


Video Data Formats 

18-bit single pixel, 24-bit single pixel, 18-bit 
dual pixel, or 24-bit dual pixel R’G’B’ data is 
transferred over the link. Table 6.36 illustrates 
the mapping between the pixel data bit number 
and the OpenLDI bit number. 

The 18-bit single pixel R’G’B’ format uses 
three 6-bit R’G’B’ values: RO-R5, GO-G5, and 
BO-B5. OpenLDI serial data lines AO-A2 are 
used to transfer the data. 

The 24-bit single pixel R’G’B’ format uses 
three 8-bit R’G’B’ values: RO-R7, GO-G7, and 
BO-B7. OpenLDI serial data lines AO-A3 are 
used to transfer the data. 


The 18-bit dual pixel R’G’B’ format repre- 
sents two pixels as three upper/lower pairs of 
6-bit R’G’B’ values: RUO-RU5, GU0-GU5, 
BUO-BU5, RLO-RL5, GLO-GL5, BLO-BL5. 
Each upper/lower pair represents two pixels. 
OpenLDI serial data lines AO-A2 and A4—A6 
are used to transfer the data. 

The 24-bit dual pixel R’G’B’ format repre- 
sents two pixels as three upper/lower pairs of 
8-bit R’G’B’ values: RUO-RU7, GU0-GU7, 
BUO-BU7, RLO-RL7, GLO-GL7, BLO-BL7. 
Each upper/lower pair represents two pixels. 
OpenLDI serial data lines AO-A7 are used to 
transfer the data. 


Control Signals 
In addition to the video data, there are seven 
control signals: 


HSYNC horizontal sync 
VSYNC vertical sync 
DE data enable 
CNTLE reserved 
CNTLF reserved 

CLK1 1x sample clock 
CLK2 1x sample clock 


During unbalanced operation, the DE, 
HSYNC, VSYNC, CNTLE, and CNTLF levels 
are sent as unencoded bits within the A2 and 
A6 bitstreams. 

During balanced operation (used to mini- 
mize short- and long-term DC bias), a DC Bal- 
ance bit is sent within each of the A0-A7 
bitstreams to indicate whether the data is 
unmodified or inverted. Since there is no room 
left for the control signals to be sent directly, 
the DE level is sent by slightly modifying the 
timing of the falling edge of the CLK1 and 
CLK2 signals. The HSYNC, VSYNC, CNTLE 
and CNTLF levels are sent during the blanking 
intervals using 7-bit code words on the AO, A1, 
A5, and A4 signals, respectively. 
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LVDS LVDS LVDS 
TRANSMITTER LINK RECEIVER 
BO-B7 ——+->| > BO-B7 
VSYNC_ ———>4 CHANNEL O | | > VSYNC 
HSYNC_ ———>4 > HSYNC 
CHANNEL 1 
DE ——> >| K-+——-_ DE 
CHANNEL 2 || 
GO-G7 ——;—> Aas > G0-G7 
CNTLE |_,| ENCODER >| RECEIVER » CNTLE 
AND AND 
SERIALIZER CHANNEL 4 | J) DECODER 
CHANNEL 5 
| 
RO-R7. ———> CHANNEL 6 | | > RO-R7 
CNTLF_ ——>| > CNTLF 
CHANNEL 7 
| 
CLK1 > CLK1 
CLK2 > CLK2 
Figure 6.49. OpenLDI LVDS Link. 
18 Bits per Pixel 24 Bits per Pixel OpenLDI 
Bit Number Bit Number Bit Number 
5 7 5 
4 6 4 
3 5 3 
2 4 2 
1 3 a 
0 2 0 
1 7 
0 6 


Table 6.36. OpenLDI Bit Number Mappings. 
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Connector 

The 36-pin mini-D ribbon (MDR) connector is 
similar to the one shown in Figure 6.48, except 
that there are two rows of eighteen contacts. 
Table 6.37 lists the pin assignments. 


Gigabit Video Interface (GVIF) 


The Sony GVIF was developed for transferring 
uncompressed digital video using a single dif- 
ferential signal, instead of the multiple signals 
that DVI, DFP and OpenLDI use. Cable 
lengths may be up to 10 meters. 


GVIF Link 
The GVIF link, as shown in Figure 6.50, sup- 
ports formats and timings requiring a clock 
rate of 20-80 MHz. For applications requiring 
higher clock rates, more than one GVIF link 
may be used. 

The serial data rate is 24x the sample clock 
rate for 18-bit R’'G’B’ data, or 30x the sample 
clock rate for 24-bit R’G’B’ data. 


Video Data Formats 

18-bit or 24-bit R’G’B’ data, plus timing, is 
transferred over the link. The 18-bit R’G’B’ for- 
mat uses three 6-bit R’G’B’ values: RO-R5, GO- 
G5, and BO-B5. The 24-bit R’G’B’ format uses 
three 8-bit R’G’B’ values: RO-R7, GO-G7, and 
BO-B7. 


Pin Signal Pin Signal Pin Signal 
1 AO- 13 +5V 25 reserved 
2 Al- 14 A4- 26 reserved 
3 A2- 15 A5- 27 ground 
4 CLK1- 16 A6- 28 DDC SDA 
5 A3- 17 A7- 29 ground 
6 ground 18 CLK2- 30 USB- 
7 reserved 19 AO 31 ground 
8 reserved 20 Al 32 A4 
9 reserved 21 A2 33 AS 

10 DDC SCL 22 CLK1 34 A6 

11 +0V 23 A383 35 AZ 

12 USB 24 reserved 36 CLK2 


Table 6.37. OpenLDI Connector Signal Assignments. 
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18-bit R’G’B’ data is converted to 24-bit 
data by slicing the R’G’B data into six 3-bit val- 
ues that are in turn transformed into six 4-bit 
codes. This ensures rich transitions for 
receiver PLL locking and good DC balance. 

24-bit R’G’B’ data is converted to 30-bit 
data by slicing the R’G’B data into six 4-bit val- 
ues that are in turn transformed into six 5-bit 
codes. 


Control Signals 
In addition to the video data, there are six con- 
trol signals: 


HSYNC horizontal sync 
VSYNC vertical sync 
DE data enable 


CTLO reserved 
CTLI1 reserved 
CLK 1x sample clock 


If any of the HSYNC, VSYNC, DE, CTLO, 
or CTL1 signals change, during the next CLK 
cycle a special 30-bit format is used. The first 
six bits are header data indicating the new lev- 
els of HSYNC, VSYNC, DE, CTLO, or CTLI. 
This is followed by 24 bits of R’G’B data (unen- 
coded except for inverting the odd bits). 

Note that during the blanking periods, 
non-video data, such as digital audio, may be 
transferred. The CTL signals may be used to 
indicate when non-video data is present. 


GVIF 
TRANSMITTER 
BO-B7 ——~+— > 
VSYNC_ ——;—>} 
HSYNC_ ———~—* 
DE ——iaal 
GO-G7_ ——— 
CTLO ————> 
ENCODER 
AND 
SERIALIZER 
ROo-R7 —— jl 
cTL1 —— 
CLK ———> 


GVIF 
RECEIVER 
> BO-B7 
=> VSYNC 
> HSYNC 
> DE 
> GO0-G7 
> CTLO 
SDATA RECEIVER 
>| AND 
DECODER 
> RO-R7 
> CTL1 
> CLK 


Figure 6.50. GVIF Link. 
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Consumer Transport 
Interfaces 


IEEE 1394 


IEEE 1394 was originally developed by Apple 
Computer as Firewire. Designed to be a 
generic interface between devices, 1394 speci- 
fies the physical characteristics; separate appli- 
cation-specific specifications describe how to 
transfer data over the 1394 network. The SCTE 
DVS-194, EIA-775, and ITU-T J.117 specifica- 
tions for compatibility between digital televi- 
sions and settop boxes specifically include 
IEEE 1394 support. 

1394 is a transaction-based packet technol- 
ogy, using a bi-directional serial interconnect 
that features hot plug-and-play. This enables 
devices to be connected and disconnected 
without affecting the operation of other 
devices connected to the network. 

Guaranteed delivery of time-sensitive data 
is supported, enabling digital audio and video 
to be transferred in real time. In addition, mul- 
tiple independent streams of digital audio and 
video can be carried. 


Specifications 
The original 1394-1995 specification supports 
bit rates of 98.304, 196.608, and 393.216 Mbps. 
The proposed P1394-2000 (previously 
known as P1394a) specification clarifies areas 
that were vague and led to system interopera- 
bility issues. It also reduces the overhead lost 
to bus control, arbitration, bus reset duration, 
and concatenation of packets. P1394-2000 also 
introduces advanced power-savings features. 
The electrical signalling method is also com- 
mon between 1394-1995 and P1394-2000, using 
data-strobe (DS) encoding and analog-speed 
signaling. 


The proposed P1394b specification adds 
support for bit rates of 786.432, 1572.864, and 
3145.728 Mbps. It also includes the 8B/10B 
encoding technique used by Gigabit Ethernet, 
and changes the speed signalling to a more 
digital method. P1394b also supports new 
transport media in addition to copper cables, 
including plastic optical fiber (POF), glass opti- 
cal fiber (GOF), and Cat5 cable. With the new 
media come extended distances—up to 100 
meters using Cat5. 


Endian Issues 

1394 uses a big-endian architecture, defining 
the most significant bit as bit 0. However, many 
processors are based on the little endian archi- 
tecture which defines the most significant bit 
as bit 31 (assuming a 32-bit word). 


Network Topology 

Like many networks, there is no designated 
bus master. The tree-like network structure 
has a root node, branching out to logical nodes 
in other devices (Figure 6.51). The root is 
responsible for certain control functions, and 
is chosen during initialization. Once chosen, it 
retains that function for as long as it remains 
powered-on and connected to the network. 

A network can include up to 63 nodes, with 
each node (or device) specified by a 6-bit phys- 
ical identification number. Multiple networks 
may be connected by bridges, up to a system 
maximum of 1,023 networks, with each net- 
work represented by a separate 10-bit bus ID. 
Combined, the 16-bit address allows up to 
64,449 nodes in a system. Since device 
addresses are 64 bits, and 16 of these bits are 
used to specify nodes and networks, 48 bits 
remain for memory addresses, allowing up to 
256TB of memory space per node. 
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Figure 6.51. IEEE 1394 Network Topology Examples. 


Node Types 

Nodes on a 1394 bus may vary in complexity 
and capability (listed simplest to most com- 
plex): 

Transaction nodes respond to asynchro- 
nous communication, implement the minimal 
set of control status registers (CSR), and 
implement a minimal configuration ROM. 

Isochronous nodes add a 24.576 MHz clock 
used to increment a cycle timer register that is 
updated by cycle start packets. 

Cycle master nodes add the ability to gener- 
ate the 8 kHz cycle start event, generate cycle 
start packets, and implement a bus timer regis- 
ter. 

Isochronous resource manager (IRM) nodes 
add the ability to detect bad self-ID packets, 
determine the node ID of the chosen IRM, and 
implement the channels available, bandwidth 
available, and bus manager ID registers. At 
least one node must be capable of acting as an 
IRM to support isochronous communication. 

Bus manager (BM) nodes are the most 
complex. This level adds responsibility for 
storing every self-ID packet in a topology map 


and analyzing that map to produce a speed 
map of the entire bus. These two maps are 
used to manage the bus. Finally, the BM must 
be able to activate the cycle master node, write 
configuration packets to allow optimization of 
the bus, and act as the power manager. 


Node Ports 

In the network topology, a one-port device is 
known as a “leaf” device since it is at the end of 
a network branch. They can be connected to 
the network, but cannot expand the network. 

Two-port devices can be used to form 
daisy-chained topologies. They can be con- 
nected to and continue the network, as shown 
in Figure 6.51. Devices with three or more 
ports are able to branch the network to the full 
63-node capability. 

It is important to note that no loops or par- 
allel connections are allowed within the net- 
work. Also, there are no_ reserved 
connectors—any connector may be used to 
add a new device to the network. 

Since 1394-1995 mandates a maximum of 
16 cable “hops” between any two nodes, a max- 
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imum of 17 peripherals can be included in a 
network if only two-port peripherals are used. 
Later specifications implement a “ping” packet 
to measure the round-trip delay to any node, 
removing the 16 “hop” limitation. A 2- or 3-pair 
shielded cable is used, with one pair used for 
serial data, and one pair used for the data 
strobe signal. A third optional pair may be 
used to provide power to peripherals. 

Figure 6.52 illustrates the 1394-1995 and 
P1394-2000 data and strobe timing. The strobe 
signal changes state on every bit period for 
which the data signal does not. Therefore, by 
exclusive-ORing the data and strobe signals, 
the clock is recovered. 


Physical Layer 

The typical hardware topology of a 1394 net- 
work consists of a physical layer (PHY) and 
link layer (LINK), as shown in Figure 6.53. 
The 1394-1995 standard also defined two soft- 
ware layers, the transaction layer and the bus 
management layer, parts of which may be 
implemented in hardware. 

The PHY transforms the point-to-point net- 
work into a logical physical bus. Each node is 
also essentially a data repeater since data is 
reclocked at each node. The PHY also defines 
the electrical and mechanical connection to the 
network. Physical signaling circuits and logic 


DATA 


a a 


responsible for power-up initialization, arbitra- 
tion, bus-reset sensing, and data signaling are 
also included. 


Link Layer 

The LINK provides interfacing between the 
physical layer and application layer, formatting 
data into packets for transmission over the net- 
work. It supports both asynchronous and iso- 
chronous data. 


Asynchronous Data 

Asynchronous packets are guaranteed deliv- 
ery since after an asynchronous packet is 
received, the receiver transmits an acknowl- 
edgment to the sender, as shown in Figure 
6.54. However, there is no guaranteed band- 
width. This type of communication is useful for 
commands, non-real-time data, and error-free 
transfers. 

The delivery latency of asynchronous 
packets is not guaranteed and depends upon 
the network traffic. However, the sender may 
continually retry until an acknowledgment is 
received. 

Asynchronous packets are targeted to one 
node on the network or can be sent to all 
nodes, but can not be broadcast to a subset of 
nodes on the bus. 
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Figure 6.52. IEEE 1394 Data and Strobe Slgnal Timing. 
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The maximum asynchronous packet size 
is: 


512 * (n/ 100) bytes 
n = network speed in Mbps 


Isochronous Data 

Isochronous communications have a guaran- 
teed bandwidth, with up to 80% of the network 
bandwidth available for isochronous use. Up to 
63 independent isochronous channels are 
available, although the 1394 Open Host Con- 
troller Interface (OHCI]) currently only sup- 
ports 4-32 channels. This type of 
communication is useful for real-time audio 
and video transfers since the maximum deliv- 
ery latency of isochronous packets is calcula- 
ble and may be targeted to multiple 
destinations. However, the sender may not 
retry to send a packet. 


1394 LINK LAYER 
CONTROL (LLC) 


The maximum isochronous packet size is: 


1024 * (n / 100) bytes 
n = network speed in Mbps 


Isochronous operation guarantees a time 
slice each 125 ps. Since time slots are guaran- 
teed, and isochronous communication takes 
priority over asynchronous, isochronous band- 
width is assured. 

Once an isochronous channel is estab- 
lished, the sending device is guaranteed to 
have the requested amount of bus time for that 
channel every isochronous cycle. Only one 
device may send data on a particular channel, 
but any number of devices may receive data on 
a channel. A device may use multiple isochro- 
nous channels as long as capacity is available. 
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Figure 6.53. IEEE 1394 Typical Physical and Link Layer Block Diagrams. 
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Transaction Layer 

The transaction layer supports asynchronous 
write, read, and lock commands. A lock com- 
bines a write with a read by producing a round 
trip routing of data between the sender and 
receiver, including processing by the receiver. 


Bus Management Layer 

The bus management layer control functions 
of the network at the physical, link, and trans- 
action layers. 


Digital Transmission Content Protection 
(DTCP) 

To prevent unauthorized copying of content, 
the DTCP system was developed. Although 
originally designed for 1394, it is applicable to 
any digital network that supports bi-directional 
communications, such as USB. 

Device authentication, content encryp- 
tion, and renewability (should a device ever be 
compromised) are supported by DTCP. The 
Digital Transmission Licensing Administrator 
(DTLA) licenses the content protection system 


ISOCHRONOUS 
PACKETS 


and distributes cipher keys and device certifi- 
cates. 

DTCP outlines four elements of content 
protection: 


1. Copy control information (CCI) 

2. Authentication and key exchange (AKE) 
3. Content encryption 

4, System renewability 


Copy Control Information (CCI) 

CCI allows content owners to specify how their 
content can be used, such as “copy-never,” 
“copy-one-generation,” “no-more-copies,” and 
“copy-free.” DTCP is capable of securely com- 
municating copy control information between 
devices. Two different CCI mechanisms are 
supported: embedded and encryption mode indi- 
cator. 

Embedded CCI is carried within the con- 
tent stream. Tampering with the content 
stream results in incorrect decryption, main- 
taining the integrity of the embedded CCI. 
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Figure 6.54. IEEE 1394 Isochronous and Asynchronous Packets. 
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The encryption mode indicator (EMI) pro- 
vides a secure, yet easily accessible, transmis- 
sion of CCI by using the two most significant 
bits of the sync field of the isochronous packet 
header. Devices can immediately determine 
the CCI of the content stream without decod- 
ing the content. If the two EMI bits are tam- 
pered with, the encryption and decryption 
modes do not match, resulting in incorrect 
content decryption. 


Authentication and Key Exchange (AKE) 
Before sharing content, a device must first ver- 
ify that the other device is authentic. DTCP 
includes a choice of two authentication levels: 
full and restricted. Full authentication can be 
used with all content protected by the system. 
Restricted authentication enables the protec- 
tion of “copy-one-generation” and “no-more- 
copies” content only. 


Full authentication 

Compliant devices are assigned a unique pub- 
lic/private key pair and a device certificate by 
the DTLA, both stored within the device so as 
to prevent their disclosure. In addition, devices 
store other constants and keys necessary to 
implement the cryptographic protocols. 

Full authentication uses the public key- 
based digital signature standard (DSS) and 
Diffie-Hellman (DH) key exchange algo- 
rithms. DSS is a method for digitally signing 
and verifying the signatures of digital docu- 
ments to verify the integrity of the data. DH 
key exchange is used to establish control-chan- 
nel symmetric cipher keys, which allows two 
or more devices to generate a shared key. 

Initially, the receiver sends a request to the 
source to exchange device certificates and ran- 
dom challenges. Then, each device calculates a 
DH key exchange first-phase value. The 
devices then exchange signed messages that 
contain the following elements: 


1. The other device’s random challenge 

2. The DH key-exchange first-phase value 

3. The renewability message version num- 
ber of the newest system renewability 
message (SRM) stored by the device 


The devices check the message signatures 
using the other device’s public key to verify 
that the message has not been tampered with 
and also verify the integrity of the other 
device’s certificate. Each device also examines 
the certificate revocation list (CRL) embedded 
in its system renewability message (SRM) to 
verify that the other device’s certificate has not 
been revoked due to its security having been 
compromised. If no errors have occurred, the 
two devices have successfully authenticated 
each other and established an authorization 
key. 


Restricted authentication 

Restricted authentication may be used 
between sources and receivers for the 
exchange of “copy-one-generation” and “no- 
more-copies” contents. It relies on the use of a 
shared secret to respond to a random chal- 
lenge. 

The source initiates a request to the 
receiver, requests it’s device ID, and sends a 
random challenge. After receiving the random 
challenge back from the source, the receiver 
computes a response and sends it to the 
source. 

The source compares this response with 
similar information generated by the source 
using its service key and the ID of the receiver. 
If the comparison matches its own calculation, 
the receiver has been verified and authenti- 
cated. The source and receiver then each cal- 
culate an authorization key. 
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Content Encryption 

To ensure interoperability, all compliant 
devices must support the 56-bit M6 baseline 
cipher. Additional content protection may be 
supported by using additional, optional 
ciphers. 


System Renewability 

Devices that support full authentication can 
receive and process SRMs that are created by 
the DTLA and distributed with content. System 
renewability is used to ensure the long-term 
system integrity by revoking the device IDs of 
compromised devices. 

SRMs can be updated from other compli- 
ant devices that have a newer list, from media 
with prerecorded content, or via compliant 
devices with external communication capabil- 
ity (Internet, phone, cable, or network, etc.). 


Example Operation 

For this example, the source has been 
instructed to transmit a copy protected system 
stream of content. 

The source initiates the transmission of 
content marked with the copy protection sta- 
tus: “copy-one-generation,” “copy-never,” “no- 
more-copies,” or “copy-free.” 

Upon receiving the content stream, the 
receiver determines the copy protection status. 
If marked “copy never,” the receiver requests 
that the source initiate full authentication. If 
the content is marked “copy once” or “no more 
copies,” the receiver will request full authenti- 
cation if supported, or restricted authentica- 
tion if it isn’t. 

When the source receives the authentica- 
tion request, it proceeds with the requested 
type of authentication. If full authentication is 
requested but the source can only support 
restricted authentication, then restricted 
authentication is used. 


Once the devices have completed the 
authentication procedure, a content-channel 
encryption key (content key) is exchanged 
between them. This key is used to encrypt the 
content at the source device and decrypt the 
content at the receiver. 


1394 Open Host Controller Interface 
(OHCI) 

The 1394 Open Host Controller Interface 
(OHCI) specification is an implementation of 
the 1394 link layer, with additional features to 
support the transaction and bus management 
layers. It provides a standardized way of inter- 
acting with the 1394 network. 


Home AV Interoperability (HAVi) 

Home AV Interoperability (HAVi) is another 
layer of protocols for 1394. HAVi is directed at 
making 1394 devices plug-and-play interopera- 
ble in a 1394 network whether or not a PC host 
is present. 


Serial Bus Protocol (SBP-2) 

The ANSI Serial Bus Protocol 2 (SBP-2) 
defines standard way of delivering command 
and status packets over 1394 for devices such 
DVD players, printers, scanners, hard drives, 
and other devices. 


IEC 61883 Specifications 
Certain types of isochronous signals, such as 
MPEG 2 or IEC 61834 and SMPTE 314M digi- 
tal video (DV), use specific data transport pro- 
tocols and formats. When this data is sent 
isochronously over a 1394 network, special 
packetization techniques are used. 

The IEC 61883 series of specifications 
define the details for transferring various appli- 
cation-specific data over 1394: 
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IEC 61883-1 = General specification 


IEC 61883-2 = SD-DVCR data transmission 
25 Mbps continuous bit rate 


IEC 61883-3 = HD-DVCR data transmission 


IEC 61883-4 = MPEG2-TS data transmission 
bit rate bursts up to 44 Mbps 


IEC 61883-5 = SDL-DVCR data transmission 


IEC 61883-6 = Audio and music data transmission 


IEC 61883-1 

IEC 61883-1 defines the general structure for 
transferring digital audio and video data over 
1394. It describes the general packet format, 
data flow management, and connection man- 
agement for digital audio and video data, and 
also the general transmission rules for control 
commands. 


NORMAL 
ISOCHRONOUS 
PACKET 


PACKET HEADER 


HEADER CRC 


ISOCHRONOUS 
PACKET 
PAYLOAD 


DATA CRC 


32 BITS 


A common isochronous packet (CIP) 
header is placed at the beginning of the data 
field of isochronous data packets, as shown in 
Figure 6.55. It specifies the source node, data 
block size, data block count, time stamp, type 
of real-time data contained in the data field, etc. 

A connection management procedure 
(CMP) is also defined for making isochronous 
connections between devices. 

In addition, a functional control protocol 
(FCP) is defined for exchanging control com- 
mands over 1394 using asynchronous data. 


IEC 61883-2 

IEC 61883-2 defines the CIP header, data 
packet format, and transmission timing for IEC 
61834 and SMPTE 314M digital audio and 
compressed digital video (DV) data over 1394. 
Active resolutions of 720 x 480 (at 29.97 frames 
per second) and 720 x 576 (at 25 frames per 
second) are supported. 
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Figure 6.55. 61883-2 Isochronous Packet Formatting. 
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DV data packets are 488 bytes long, made 
up of 8 bytes of CIP header and 480 bytes of 
DV data, as shown in Figure 6.55. Figure 6.56 
illustrates the frame data structure. 

Each of the 720 x 480 4:1:1 YCbCr frames 
are compressed to 103,950 bytes, resulting in a 
4.9:1 compression ratio. Including overhead 
and audio increases the amount of data to 
120,000 bytes. 

The compressed 720 x 480 frame is 
divided into 10 DIF (data in frame) sequences. 
Each DIF sequence contains 150 DIF blocks of 
80 bytes each, used as follows: 


135 DIF blocks for video 
9 DIF blocks for audio 


6 DIF blocks used for Header, Subcode, and Video 
Auxiliary (VAUX) information 


Figure 6.57 illustrates the DIF sequence 
structure in detail. The audio DIF blocks con- 
tain both audio data and audio auxiliary data 
(AAUX). IEC 61834 supports four 32-kHz, 12- 
bit nonlinear audio signals or two 48-, 44.1-, or 
32-kHz, 16-bit audio signals. SMPTE 314M at 
25 Mbps supports two 48-kHz 16-bit audio sig- 
nals, while the 50 Mbps version supports four. 
Video auxiliary data (VAUX) DIF blocks 
include recording date and time, lens aperture, 
shutter speed, color balance, and other camera 
setting data. The subcode DIF blocks store a 
variety of information, the most important of 
which is timecode. 

Each video DIF block contains 80 bytes of 
compressed macroblock data: 


3 bytes for DIF block ID information 


1 byte for the header that includes the quantiza- 
tion number (QNO) and block status (STA) 


14 bytes each for YO, Y1, Y2, and Y3 
10 bytes each for Cb and Cr 


As the 488-byte packets come across the 
1394 network, the start of a video frame is 
determined. Once the start of a frame is 
detected, 250 valid packets of data are col- 
lected to have a complete DV frame; each 
packet contains 6 DIF blocks of data. Every 
15th packet is a null packet and should be dis- 
carded. Once 250 valid packets of data are in 
the buffer, discard the CIP headers. If all went 
well, you have a frame buffer with a 120,000 
byte compressed DV frame in it. 

720 x 576 frames may use either the 4:2:0 
YCbCr format (IEC 61834) or the 4:1:1 YCbCr 
format (SMPTE 314M), and require 12 DIF 
sequences. Each 720 x 576 frame is com- 
pressed to 124,740 bytes. Including overhead 
and audio increases the amount of data to 
144,000 bytes, requiring 300 packets to trans- 
fer. 

Note that the organization of data trans- 
ferred over 1394 differs from the actual DV 
recording format since error correction is not 
required for digital transmission. In addition, 
although the video blocks are numbered in 
sequence in Figure 6.57, the sequence does 
not correspond to the left-to-right, top-to-bot- 
tom transmission of blocks of video data. Com- 
pressed macroblocks are shuffled to minimize 
the effect of errors and aid in error conceal- 
ment. Audio data also is shuffled. Data is trans- 
mitted in the same shuffled order as recorded. 

To illustrate the video data shuffling, DV 
video frames are organized as 50 super blocks, 
with each super block being composed of 27 
compressed macroblocks, as shown in Figure 
6.58. A group of 5 super blocks (one from each 
super block column) make up one DIF 
sequence. Table 6.38 illustrates the transmis- 
sion order of the DIF blocks. Additional infor- 
mation on the DV data structure is available in 
Chapter 11. 
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1 FRAME IN 1.001/30 SECOND (10 DIF SEQUENCES) 


~<t La 
DIFSO DIFS1 DIFS2 DIFS3 DIFS4 DIFS5 DIFS6 DIFS7 DIFS8 DIFS9 
1 DIF SEQUENCE IN 1.001/300 SECOND (150 DIF BLOCKS) 
~<t La 
HEADER | SUBCODE VAUX 
(1 DIF) (2 DIF) (3. DIF) 135 VIDEO AND 9 AUDIO DIF BLOCKS 
150 DIF BLOCKS IN 1.001/30 SECOND 
= Laat 
DIFO DIF1 DIF2 DIF3 DIF4 DIF5 DIF6 DIF148 | DIF149 
1 DIF BLOCK IN 1.001 / 45000 SECOND 
~<t > 
ID HEADER DATA 
(3 BYTES) | (1 BYTE) (76 BYTES) 
tg? (14 BYTES) a (14 pMreS) gis (14 TES) Pad (14 BYTES) gees (10 BYTES). qe (10 BITES) 
COMPRESSED MACROBLOCK 
~<t > 


Figure 6.56. IEC 61834 and SMPTE 314M Packet Formatting for 720 x 480 Systems (4:1:1 


YCbCr). 
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H = HEADER SECTION 


H sco sci VAO VAI VA2 Sco, SC1 = SUBCODE SECTION 
VAO, VA1, VA2 = VAUX SECTION 
0 { 2 3 4 5 AO-A8 = AUDIO SECTION 
VoO-V134 = VIDEO SECTION 
ie AO vo Vv | v2 V3 V4 V5 V6 V7 v8 vg Vv10 vii vi2 | V13 vi4 
6 a 8 9 10 11 12 13 14 15 16 17 18 19 20 21 
ie Al V15 V16 | V17 V18 Vvi9 v20 v21 v22 V23 V24 V25 V26 V27 | V28 v29 
22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 
ie A8& Vv120 vi2i | V122 V123 v124 V125 V126 V127 V128 v129 V130 V131 V132 V133 V134 
134 135 136 137 138 139 140 144 142 143 144 145 146 147 148 149 


Figure 6.57. IEC 61834 and SMPTE 314M DIF Sequence Detail (25 Mbps). 
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480 
LINES 


720 SAMPLES 


~ > 
SUPERBLOCK 
0 1 2 3 4 
0) S0,0 S0,1 $0,2 S0,3 S0,4 
1 $1,0 S11 $1,2 $1,3 $1,4 
2 S$2,0 $2,1 $2,2 $2,3 $2,4 
3 $3,0 $3,1 $3,2 $3,3 $3,4 
4 $4,0 S4,1 $4,2 $4,3 $4,4 
5 $5,0 $5,1 $5,2 $5,3 $5,4 
6 $6,0 S6,1 $6,2 $6,3 S6,4 
7 $7,0 S7,1 $7,2 $7,3 $7,4 
8 S8,0 S8,1 $8,2 $8,3 S8,4 
9 $9,0 S9,1 $9,2 $9,3 $9,4 


MACROBLOCK 


Figure 6.58. Relationship Between Super Blocks and Macroblocks (720 x 480, 4:1:1 YCbCr). 
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Video Compressed Video Compressed 
DIF DIF Macroblock DIF DIF Macroblock 
Sequence Block Sequence Block 
Number oc Superblock | Macroblock Number oc Superblock | Macroblock 
Number Number Number Number Number Number 
0 2,2 0 
1 6,1 0 0 1,2 0 
2 8,3 0 1 5, 1 0 
a 3 0, 0 0 2 7,3 0 
4 4,4 0 3 n-1,0 0 
n-1 
: 4 3,4 0 
133 0, 0 26 : 
134 4,4 26 133 n-1,0 26 
0 3, 2 0 134 3,4 26 
1 (el 0 
2 9,3 0 
3 1,0 0 
1 
4 5, 4 0 
133 1,0 26 
134 5, 4 26 
Notes: 


1. n = 10 for 480-line systems, n = 12 for 576-line systems. 


Table 6.38. Video DIF Blocks and Compressed Macroblocks for 25 Mbps. 
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IEC 61883-4 

IEC 61883-4 defines the CIP header, data 
packet format, and transmission timing for 
MPEG 2 Transport Streams over 1394. 

It is most efficient to carry an integer num- 
ber of 192 bytes (188 bytes of MPEG data plus 
4 bytes of time stamp) per isochronous packet, 
as shown in Figure 6.59. However, MPEG data 
rates are rarely integer multiples of the isoch- 
ronous data rate. Thus, it is more efficient to 
divide the MPEG packets into smaller compo- 
nents of 24 bytes each to maximize available 
bandwidth. The transmitter then uses an inte- 
ger number of data blocks (restricted multi- 
ples of 0, 1, 2, 4, or 8, placing then in an 
isochronous packet and adding the 8-byte CIP 
header. 


NORMAL 
ISOCHRONOUS 
PACKET 


PACKET HEADER 


HEADER CRC 


ISOCHRONOUS 
PACKET 


PAYLOAD 


DATA CRC 


32 BITS 


Fe 


i 


Digital Camera Specification 
The 1394 Trade Association has written a spec- 
ification for 1394-based digital video cameras. 
This was done to avoid the silicon and software 
cost of implementing the full IEC 61883 specifi- 
cation. 

Seven resolutions are defined, with a wide 
range of format support: 


160 x 120 4:4:4 YCbCr 

320 x 240 4:2:2 YCbCr 

640 x 480 4:1:1, 4:2:2 YCbCr, 24-bit RGB 
800 x 600 4:2:2 YCbCr, 24-bit RGB 

1024 x 768 4:2:2 YCbCr, 24-bit RGB 

1280 x 960 4:2:2 YCbCr, 24-bit RGB 
1600x1200 = 4:2:2 YCbCr, 24-bit RGB 


Supported frame rates are 1.875, 3.75, 7.5, 15, 
30, and 60 frames per second. 

Isochronous packets are used to transfer 
the uncompressed digital video data over the 
1394 network. 


61883 - 4 
ISOCHRONOUS 
PACKET 


CIP HEADER 0 


CIP HEADER 1 


DATA 
PAYLOAD 
(192 BYTES) 


32 BITS 


Figure 6.59. 61883-4 Isochronous Packet Formatting. 
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=z 0 
Digital Video 
Processing 


In addition to encoding or decoding NTSC/ 
PAL or MPEG video, a typical system usually 
requires considerable additional video pro- 
cessing. 

Since most computer displays, and option- 
ally HDTV, are noninterlaced, interlaced video 
must be converted to noninterlaced (“deinter- 
laced”). Noninterlaced video must be con- 
verted to interlaced to drive a conventional 
analog VCR or TV, requiring noninterlaced-to- 
interlaced conversion. 

Many computer displays have a vertical 
refresh rate of about 75 Hz, whereas consumer 
video has a vertical refresh rate of 25 or 29.97 
(30/1.001) frames per second. For DVD and 
HDTV, source material may only be 24 frames 
per second. Thus, some form of frame rate 
conversion must be done. 
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Another not-so-subtle problem includes 
video scaling. SDTV and HDTV support mulTti- 
ple resolutions, yet the display may be a single, 
fixed resolution. 

Alpha mixing and chroma keying are used 
to mix multiple video signals or video with 
computer-generated text and graphics. Alpha 
mixing ensures a smooth crossover between 
sources, allows subpixel positioning of text, 
and limits source transition bandwidths to sim- 
plify eventual encoding to composite video sig- 
nals. 

Since no source is perfect, even digital 
sources, user controls for adjustable bright- 
ness, contrast, saturation, and hue are always 
desirable. 
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Rounding Considerations 


When two 8-bit values are multiplied together, 
a 16-bit result is generated. At some point, a 
result must be rounded to some lower preci- 
sion (for example, 16 bits to 8 bits or 32 bits to 
16 bits) in order to realize a cost-effective hard- 
ware implementation. There are several round- 
ing techniques: truncation, conventional 
rounding, error feedback rounding, and 
dynamic rounding. 


Truncation 


Truncation drops any fractional data during 
each rounding operation. As a result, after only 
a few operations, a significant error may be 
introduced. This may result in contours being 
visible in areas of solid colors. 


Conventional Rounding 


Conventional rounding uses the fractional data 
bits to determine whether to round up or 
round down. If the fractional data is 0.5 or 
greater, rounding up should be performed— 
positive numbers should be made more posi- 
tive and negative numbers should be made 
more negative. If the fractional data is less than 
0.5, rounding down should be performed— 


positive numbers should be made less positive 
and negative numbers should be made less 
negative. 


Error Feedback Rounding 


Error feedback rounding follows the principle 
of “never throw anything away.” This is accom- 
plished by storing the residue of a truncation 
and adding it to the next video sample. This 
approach substitutes less visible noise-like 
quantizing errors in place of contouring effects 
caused by simple truncation. An example of an 
error feedback rounding implementation is 
shown in Figure 7.1. In this example, 16 bits 
are reduced to 8 bits using error feedback. 


Dynamic Rounding 


This technique (a licensable Quantel patent) 
dithers the LSB according to the weighting of 
the discarded fractional bits. The original data 
word is divided into two parts, one represent- 
ing the resolution of the final output word and 
one dealing with the remaining fractional data. 
The fractional data is compared to the output 
of a random number generator equal in resolu- 
tion to the fractional data. The output of the 
comparator is a 1-bit random pattern weighted 
by the value of the fractional data, and serves 


REGISTER 


Figure 7.1. Error Feedback Rounding. 
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as a carry-in to the adder. In all instances, only 
one LSB of the output word is changed, in a 
random fashion. An example of a dynamic 
rounding implementation is shown in Figure 
7.2. 


SDTV - HDTV YCbCr 
Transforms 


SDTV and HDTV applications have different 
colorimetric characteristics, as discussed in 
Chapter 3. Thus, when SDTV (HDTV) data is 
displayed on a HDTV (SDTV) display, the 
YCbCr data should be processed to compen- 
sate for the different colorimetric characteris- 
tics. 


SDTV to HDTV 


A 3x3 matrix can be used to convert from 
Y¢6o9CbCr (SDTV) to Y799CbCr (HDTV): 


16 8 (MSB) 


1 -0.11554975 —0.20793764 
0 1.01863972 0.11461795 
0 0.07504945  1.02532707 


Note that before processing, the 8-bit DC 
offset (16 for Y and 128 for CbCr) must be 
removed, then added back in after processing. 


HDTV to SDTV 


A 3x3 matrix can be used to convert from 
Y799CbCr (HDTV) to Y¢6o9yCbCr (SDTV): 


1 0.09931166 0.19169955 
0 0.98985381 —0.11065251 
0 —0.07245296 0.98339782 


Note that before processing, the 8-bit DC off- 
set (16 for Y and 128 for CbCr) must be 
removed, then added back in after processing. 


8 (LSB) 


PSEUDO 
RANDOM 
BINARY 
SEQUENCE 


COMPARATOR 


GENERATOR 


Figure 7.2. Dynamic Rounding. 
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4:4:4 to 4:2:2 YCbCr 
Conversion 


Converting 4:4:4 YCbCr to 4:2:2 YCbCr (Fig- 
ure 7.3) is a common function in digital video. 
4:2:2 YCbCr is the basis for many digital video 
interfaces, and requires fewer connections to 
implement. 

Saturation logic should be included in the 
Y, Cb, and Cr data paths to limit the 8-bit range 
to 1-254. The 16 and 128 values shown in Fig- 
ure 7.3 are used to generate the proper levels 
during blanking intervals. 


Y Filtering 


A template for the Y lowpass filter is shown in 
Figure 7.4 and Table 7.1. 

Because there may be many cascaded con- 
versions (up to 10 were envisioned), the filters 
were designed to adhere to very tight toler- 


. OPTIONAL 
VA LPF 


én OPTIONAL 
V4 LPF 


128 —»); MUX 


OPTIONAL 
CB —S—- 
LPF 


ances to avoid a buildup of visual artifacts. 
Departure from flat amplitude and group delay 
response due to filtering is amplified through 
successive stages. For example, if filters exhib- 
iting -1 dB at 1 MHz and -3 dB at 1.3 MHz 
were employed, the overall response would be 
-8 dB (at 1 MHz) and —24 dB (at 1.3 MHz) 
after four conversion stages (assuming two fil- 
ters per stage). 

Although the sharp cut-off results in ring- 
ing on Y edges, the visual effect should be min- 
imal provided that group-delay performance is 
adequate. When cascading multiple filtering 
operations, the passband flatness and group- 
delay characteristics are very important. The 
passband tolerances, coupled with the sharp 
cut-off, make the template very difficult (some 
say impossible) to match. As a result, there 
usually is temptation to relax passband accu- 
racy, but the best approach is to reduce the 
rate of cut-off and keep the passband as flat as 
possible. 


MUX = YCBCR 


Figure 7.3. 4:4:4 to 4:2:2 YCbCr Conversion. 
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ATTENUATION 
(DB) 
50 DB 
50 


40 DB 
40 


20 

12 DB 
10 
0 


0.40 FS 0.50FS O0.60FS 0.73FS 


FREQUENCY (MHZ) 


Figure 7.4. Y Filter Template. F, = Y 1x sample rate. 


Frequency Range Typical SDTV Tolerances Typical HDTV Tolerances 


Passband Ripple Tolerance 


0 to 0.40F, +0.01 dB increasing to +0.05 dB +0.05 dB 
Passband Group Delay Tolerance 

0 to 0.27F, 0 increasing to +1.35 ns +0.075T 

0.27F, to 0.40F, +1.35 ns increasing to +2 ns +0.110T 


Table 7.1. Y Filter Ripple and Group Delay Tolerances. F , = Y 1x sample 
rate. T= 1 / F,. 
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ATTENUATION 
(DB) 


60 


55 DB 
50 


20 
10 

6 DB 
. ies, 


0.20 FS 0.30 FS 0.47 FS 
0.25 FS 


FREQUENCY (MHZ) 


Figure 7.5. Cb and Cr Filter Template for Digital Filter for Sample Rate 
Conversion from 4:4:4 to 4:2:2. F, = Y 1x sample rate. 


Frequency Range Typical SDTV Tolerances Typical HDTV Tolerances 


Passband Ripple Tolerance 


0 to 0.20F, 0 dB increasing to +0.05 dB +0.05 dB 


Passband Group Delay Tolerance 


0 to 0.20F, delay distortion is zero by design 


Table 7.2. CbCr Filter Ripple and Group Delay Tolerances. F, = Y 1x sample 
rate. T= 1 / Fg. 
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CbCr Filtering 


Cb and Cr are lowpass filtered and decimated. 
In a standard design, the lowpass and decima- 
tion filters may be combined into a single filter, 
and a single filter may be used for both Cb and 
Cr by multiplexing. 

As with Y filtering, the Cb and Cr lowpass 
filtering requires a sharp cut-off to prevent 
repeated conversions from producing a cumu- 
lative resolution loss. However, due to the low 
cut-off frequency, the sharp cut-off produces 
ringing that is more noticeable than for Y. 

A template for the Cb and Cr filters is 
shown in Figure 7.5 and Table 7.2. 

Since aliasing is less noticeable in color dif- 
ference signals, the attenuation at half the sam- 
pling frequency is only 6 dB. There is an 
advantage in using a skew-symmetric response 
passing through the -6 dB point at half the 
sampling frequency—this makes alternate 
coefficients in the digital filter zero, almost 
halving the number of taps, and also allows 
using a single digital filter for both the Cb and 
Cr signals. Use of a transversal digital filter has 
the advantage of providing perfect linear phase 
response, eliminating the need for group-delay 
correction. 

As with the Y filter, the passband flatness 
and group-delay characteristics are very 
important, and the best approach again is to 
reduce the rate of cut-off and keep the pass- 
band as flat as possible. 


Display Enhancement 


Hue, Contrast, Brightness, and 
Saturation 


Working in the YCbCr color space has the 
advantage of simplifying the adjustment of con- 
trast, brightness, hue, and saturation, as shown 


in Figure 7.6. Also illustrated are multiplexers 
to allow the output of black screen (R’, G’, B’, = 
0, 0, 0), blue screen (R’, G’, B’ = 73, 121, 245), 
and color bars. 

The design should ensure that no overflow 
or underflow wrap-around errors occur; effec- 
tively saturating results to the 0 and 255 values. 


Y Processing 

16 is subtracted from the Y data to position the 
black level at zero. This removes the DC offset 
so adjusting the contrast does not vary the 
black level. Since the Y input data may have 
values below 16, negative Y values should be 
supported at this point. 

The contrast is adjusted by multiplying the 
YCbCr data by a constant. If Cb and Cr are not 
adjusted, a color shift will result whenever the 
contrast is changed. A typical 8-bit contrast 
adjustment range is 0-1.992x. 

The brightness control data is added or 
subtracted from the Y data. Brightness control 
is done after the contrast control to avoid intro- 
ducing a varying DC offset due to adjusting 
the contrast. A typical 6-bit brightness adjust- 
ment range is —32 to +31. 

Finally, 16 is added to position the black 
level at 16. 


CbCr Processing 
128 is subtracted from Cb and Cr to position 
the range about zero. 

The hue control is implemented by mixing 
the Cb and Cr data: 


Cb’ = Cb cos 6+ Cr sin 0 
Cr’= Cr cos 0 — Cb sin 0 


where 60 is the desired hue angle. A typical 8-bit 

hue adjustment range is -30° to +30°. 
Saturation is adjusted by multiplying both 

Cb and Cr by a constant. A typical 8-bit satura- 
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Figure 7.6. Hue, Saturation, Contrast, and Brightness Controls. 
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tion adjustment range is 0-1.992x. In the exam- 
ple shown in Figure 7.6, the contrast and 
saturation values are multiplied together to 
reduce the number of multipliers in the CbCr 
datapath. 

Finally, 128 is added to both Cb and Cr. 


Color Transient Improvement 


YCbCr transitions are normally aligned. How- 
ever, the Cb and Cr transitions are usually 
degraded due to the narrow bandwidth of 
color difference information. 

By monitoring coincident Y transitions, 
faster transitions may be synthesized for Cb 
and Cr. These edges are then aligned with the 
Y edge, as shown in Figure 7.7. 

Alternately, Cb and Cr transitions may be 
differentiated, and the results added to the 
original Cb and Cr signals. Small amplitudes in 
the differentiation signals should be sup- 
pressed by coring. To eliminate “wrong colors” 
due to overshoots and undershoots, the 
enhanced CbCr signals should also be limited 
to the proper range. 


——> 150 NS 


CB, CR 


ENHANCED 
CB, CR 


——= 150 NS 


Figure 7.7. Color Transient Improvement. 


Since this technique artificially increases 
the high-frequency component of video sig- 
nals, it should not be used if the video will be 
compressed, as the compression ratio will be 
reduced. 


Sharpness 


The apparent sharpness of a picture may be 
increased by increasing the amplitude of high- 
frequency luminance information. 

As shown in Figure 7.8, a simple bandpass 
filter with selectable gain (also called a peaking 
filter) may be used. The frequency where max- 
imum gain occurs is usually selectable to be 
either at the color subcarrier frequency or at 
about 2.6 MHz. A coring circuit is typically 
used after the filter to reduce low-level noise. 

Figure 7.9 illustrates a more complex 
sharpness control circuit. The high-frequency 
luminance is increased using a variable band- 
pass filter, with adjustable gain. The coring 
function (typically +1 LSB) removes low-level 
noise. The modified luminance is then added 
to the original luminance signal. 

Since this technique artificially increases 
the high-frequency component of the video sig- 
nals, it should not be used if the video will be 
compressed, as the compression ratio will be 
reduced. 

In addition to selectable gain, selectable 
attenuation of high frequencies should also be 
supported. Many televisions boost high-fre- 
quency gain to improve the apparent sharp- 
ness of the picture. If this is applied to a MPEG 
2 source, the picture quality is substantially 
degraded. Although the sharpness control on 
the television may be turned down, this affects 
the picture quality of analog broadcasts. There- 
fore, many MPEG 2 sources have the option of 
attenuating high frequencies to negate the 
sharpness control on the television. 
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Figure 7.8. Simple Adjustable Sharpness Control. (a) NTSC. (b) PAL. 
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Figure 7.9. More Complex Sharpness Control. (a) Typical implementation. (b) Coring function. 


196 = Chapter 7: Digital Video Processing 


Video Mixing and Graphics 
Overlay 


Mixing video signals may be as simple as 
switching between two video sources. This is 
adequate if the resulting video is to be dis- 
played on a computer monitor. 

For most other applications, a technique 
known as alpha mixing should be used. Alpha 
mixing may also be used to fade to or from a 
specific color (such as black) or to overlay 
computer-generated text and graphics onto a 
video signal. 

Alpha mixing must be used if the video is 
to be encoded to composite video. Otherwise, 
ringing and blurring may appear at the source 
switching points, such as around the edges of 
computer-generated text and graphics. This is 
due to the color information being lowpass fil- 
tered within the NTSC/PAL encoder. If the fil- 
ters have a sharp cut-off, a fast color transition 
will produce ringing. In addition, the intensity 
information may be bandwidth-limited to about 
4-5 MHz somewhere along the video path, 
slowing down intensity transitions. 

Mathematically, with alpha normalized to 
have values of 0-1, alpha mixing is imple- 
mented as: 


out = (alpha_0) (in_0) + (alpha_1) (in_1) +... 


In this instance, each video source has its own 
alpha information. The alpha information may 
not total to one (unity gain). 

Figure 7.10 shows mixing of two YCbCr 
video signals, each with its own alpha informa- 
tion. As YCbCr uses an offset binary notation, 
the offset (16 for Y and 128 for Cb and Cr) is 
removed prior to mixing the video signals. 
After mixing, the offset is added back in. Note 
that two 4:2:2 YCbCr streams may also be pro- 
cessed directly; there is no need to convert 


them to 4:4:4 YCbCr, mix, then convert the 
result back to 4:2:2 YCbCr. 

When only two video sources are mixed 
and alpha_0 + alpha_1 = 1 (implementing a 
crossfader), a single alpha value may be used, 
mathematically shown as: 


out = (alpha) (in_0) + (1 — alpha) (in_1) 


When alpha = 0, the output is equal to the in_1 
video signal; when alpha = 1, the output is 
equal to the in_0 video signal. When alpha is 
between 0 and 1, the two video signals are pro- 
portionally multiplied, and added together. 

Expanding and rearranging the previous 
equation shows how a two-channel mixer may 
be implemented using a single multiplier: 


out = (alpha) (in_0 —in_1) + in_1 


Fading to and from a specific color is done by 
setting one of the input sources to a constant 
color. 

Figure 7.11 illustrates mixing two YCbCr 
sources using a single alpha channel. Figures 
7.12 and 7.13 illustrate mixing two R’G’B’ 
video sources (R’G’B’ has a range of 0-255). 
Figures 7.14 and 7.15 show mixing two digital 
composite video signals. 

A common problem in computer graphics 
systems that use alpha is that the frame buffer 
may contain preprocessed R’G’B’ or YCbCr 
data; that is, the R’G’B’ or YCbCr data in the 
frame buffer has already been multiplied by 
alpha. Assuming an alpha value of 0.5, non- 
processed R’G’B’A values for white are (255, 
255, 255, 128); preprocessed R’G’B’A values 
for white are (128, 128, 128, 128). Therefore, 
any mixing circuit that accepts R’G’B’ or 
YCbCr data from a frame buffer should be able 
to handle either format. 

By adjusting the alpha values, slow to fast 
crossfades are possible, as shown in Figure 
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Figure 7.10. Mixing Two YCbCr Video Signals, Each With Its Own Alpha Channel. 
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Figure 7.11. Simplified Mixing (Crossfading) of Two YCbCr Video 
Signals Using a Single Alpha Channel. 
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Figure 7.12. Mixing Two RGB Video Signals (RGB has a 
Range of 0-255), Each With Its Own Alpha Channel. 
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Figure 7.13. Simplified Mixing (Crossfading) of Two RGB Video 
Signals (RGB has a Range of 0-255) Using a Single Alpha Channel. 
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Figure 7.14. Mixing Two Digital Composite Video Signals, Each With Its Own Alpha Channel. 
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Figure 7.15. Simplified Mixing (Crossfading) of Two Digital Composite 
Video Signals Using a Single Alpha Channel. 
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Figure 7.16. Controlling Alpha Values to Implement (a) Fast or (b) 
Slow Keying. In (a), the effective switching point lies between two 
samples. In (b), the transition is wider and is aligned at a sample 
instant. 


Luma and Chroma Keying 203 


7.16. Large differences in alpha between sam- 
ples result in a fast crossfade; smaller differ- 
ences result in a slow crossfade. If using alpha 
mixing for special effects, such as wipes, the 
switching point (where 50% of each video 
source is used) must be able to be adjusted to 
an accuracy of less than one sample to ensure 
smooth movement. By controlling the alpha 
values, the switching point can be effectively 
positioned anywhere, as shown in Figure 
7.16a. 

Text can be overlaid onto video by having a 
character generator control the alpha inputs. 
By setting one of the input sources to a con- 
stant color, the text will assume that color. 

Note that for those designs that subtract 16 
(the black level) from the Y channel before 
processing, negative Y values should be sup- 
ported after the subtraction. This allows the 
design to pass through real-world and test 
video signals with minimum artifacts. 


Luma and Chroma Keying 


Keying involves specifying a desired fore- 
ground color; areas containing this color are 
replaced with a background image. Alter- 
nately, an area of any size or shape may be 
specified; foreground areas inside (or outside) 
this area are replaced with a background 
image. 


Luminance Keying 


Luminance keying involves specifying a 
desired foreground luminance level; fore- 
ground areas containing luminance levels 
above (or below) the keying level are replaced 
with the background image. 

Alternately, this hard keying implementa- 
tion may be replaced with soft keying by speci- 


fying two luminance values of the foreground 
image: Yy and Yj, (Y,, < Yq). For keying the 
background into “white” foreground areas, 
foreground luminance values (Ypcg) above Yy 
are replaced with the background image; Yrc 
values below Y; contain the foreground image. 
For Yxc values between Y; and Yq, linear mix- 
ing is done between the foreground and back- 
ground images. This operation may be 
expressed as: 


if Ypg > Yy 
K = 1 = background only 


if YFG < YL 
K = 0 = foreground only 


if Yy => Ypo 2 Yz, 
K= (Yeo - Yp)/Wy- Yp) = mix 


By subtracting K from 1, the new lumi- 
nance keying signal for keying into “black” 
foreground areas can be generated. 

Figure 7.17 illustrates luminance keying 
for two YCbCr sources. Although chroma key- 
ing typically uses a suppression technique to 
remove information from the foreground 
image, this is not done when luminance keying 
as the magnitudes of Cb and Cr are usually not 
related to the luminance level. 

Figure 7.18 illustrates luminance keying 
for R’G’B’ sources, which is more applicable 
for computer graphics. Ypg may be obtained 
by the equation: 


Yr = 0.299R’ + 0.587G’ + 0.114B’ 


In some applications, the red and blue data is 
ignored, resulting in Ypg being equal to only 
the green data. 

Figure 7.19 illustrates one technique of 
luminance keying between two digital compos- 
ite video sources. 
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Figure 7.17. Luminance Keying of Two YCbCr Video Signals. 
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Figure 7.18. Luminance Keying of Two RGB Video Signals. RGB range is 0-255. 


206 = Chapter 7: Digital Video Processing 


LUMINANCE 
KEY 
GENERATOR 


Y/C 
SEPARATOR 


K 


BACKGROUND 
VIDEO 


FOREGROUND + 


eS = AND 
LIMITING 


ROUNDING 


= OUT 


1-K 
-) 


VIDEO 


MIXER 


Figure 7.19. Luminance Keying of Two Digital Composite Video Signals. 


Chroma Keying 


Chroma keying involves specifying a desired 
foreground key color; foreground areas con- 
taining the key color are replaced with the 
background image. Cb and Cr are used to 
specify the key color; luminance information 
may be used to increase the realism of the 
chroma keying function. The actual mixing of 
the two video sources may be done in the com- 
ponent or composite domain, although compo- 
nent mixing reduces artifacts. 

Early chroma keying circuits simply per- 
formed a hard or soft switch between the fore- 
ground and background sources. In addition to 
limiting the amount of fine detail maintained in 
the foreground image, the background was not 
visible through transparent or translucent fore- 


ground objects, and shadows from the fore- 
ground were not present in areas containing 
the background image. 

Linear keyers were developed that com- 
bine the foreground and background images 
in a proportion determined by the key level, 
resulting in the foreground image being atten- 
uated in areas containing the background 
image. Although allowing foreground objects 
to appear transparent, there is a limit on the 
fineness of detail maintained in the fore- 
ground. Shadows from the foreground are not 
present in areas containing the background 
image unless additional processing is done— 
the luminance levels of specific areas of the 
background image must be reduced to create 
the effect of shadows cast by foreground 
objects. 
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If the blue or green backing used with the 
foreground scene is evenly lit except for shad- 
ows cast by the foreground objects, the effect 
on the background will be that of shadows cast 
by the foreground objects. This process, 
referred to as shadow chroma keying, or lumi- 
nance modulation, enables the background 
luminance levels to be adjusted in proportion 
to the brightness of the blue or green backing 
in the foreground scene. This results in more 
realistic keying of transparent or translucent 
foreground objects by preserving the spectral 
highlights. 

Note that green backgrounds are now 
more commonly used due to lower chroma 
noise. 

Chroma keyers are also limited in their 
ability to handle foreground colors that are 
close to the key color without switching to the 


background image. Another problem may bea 
bluish tint to the foreground objects as a result 
of blue light reflecting off the blue backing or 
being diffused in the camera lens. Chroma 
spill is difficult to remove since the spill color 
is not the original key color; some mixing 
occurs, changing the original key color 
slightly. 

One solution to many of the chroma key- 
ing problems is to process the foreground and 
background images individually before com- 
bining them, as shown in Figure 7.20. Rather 
than choosing between the foreground and 
background, each is processed individually 
and then combined. Figure 7.21 illustrates the 
major processing steps for both the fore- 
ground and background images during the 
chroma key process. Not shown in Figure 7.20 
is the circuitry to initially subtract 16 (Y) or 
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Figure 7.20. Typical Component Chroma Key Circuit. 
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Figure 7.21. Major Processing Steps During Chroma Keying. (a) 
Original foreground scene. (b) Original background scene. (c) 
Suppressed foreground scene. (d) Background keying signal. (e) 
Background scene after multiplication by background key. (f) 
Composite scene generated by adding (c) and (e). 
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128 (Cb and Cr) from the foreground and 
background video signals and the addition of 
16 (Y) or 128 (Cb and Cr) after the final output 
adder. Any DC offset not removed will be 
amplified or attenuated by the foreground and 
background gain factors, shifting the black 
level. 

The foreground key (Kgg) and_back- 
ground key (Kg) signals have a range of 0 to 
1. The garbage matte key signal (the term 
matte comes from the film industry) forces the 
mixer to output the foreground source in one 
of two ways. 

The first method is to reduce Kgpg in pro- 
portion to increasing Kpg. This provides the 
advantage of minimizing black edges around 
the inserted foreground. 

The second method is to force the back- 
ground to black for all nonzero values of the 
matte key, and insert the foreground into the 
background “hole.” This requires a cleanup 
function to remove noise around the black 
level, as this noise affects the background pic- 
ture due to the straight addition process. 

The garbage matte is added to the fore- 
ground key signal (Kgg) using a non-additive 
mixer (NAM). A nonadditive mixer takes the 
brighter of the two pictures, on a sample-by- 
sample basis, to generate the key signal. Mat- 
ting is ideal for any source that generates its 
own keying signal, such as character genera- 
tors, and so on. 

The key generator monitors the fore- 
ground Cb and Cr data, generating the fore- 
ground keying signal, Kpcg. A desired key color 
is selected, as shown in Figure 7.22. The fore- 
ground Cb and Cr data are normalized (gener- 
ating Cb’ and Cr’) and rotated @ degrees to 
generate the X and Z data, such that the posi- 
tive X axis passes as close as possible to the 
desired key color. Typically, 9 may be varied in 
1° increments, and optimum chroma keying 


occurs when the X axis passes through the key 
color. 

X and Z are derived from Cb and Cr using 
the equations: 


X=Cb’ cos 6+ Cr’ sin 6 
Z = Cr’ cos 0 - Cb’ sin 8 


Since Cb’ and Cr’ are normalized to have a 
range of +1, X and Z have a range of +1. 

The foreground keying signal (Kpc) is 
generated from X and Z and has a range of 0-1: 


Kpg = X- (\Z|/ (tan (a/2))) 
Kgg = 0ifX < (|Z|/(tan (./2))) 


where « is the acceptance angle, symmetri- 
cally centered about the positive X axis, as 
shown in Figure 7.23. Outside the acceptance 
angle, Kpg is always set to zero. Inside the 
acceptance angle, the magnitude of Kgg lin- 
early increases the closer the foreground color 
approaches the key color and as its saturation 
increases. Colors inside the acceptance angle 
are further processed by the foreground sup- 
pressor. 

The foreground suppressor reduces fore- 
ground color information by implementing X = 
X - Kgg, with the key color being clamped to 
the black level. To avoid processing Cb and Cr 
when Kgg = 0, the foreground suppressor per- 
forms the operations: 


Cbrc¢ =Cb- Kroc cos 8 
Crpc¢ =Cr- Kroc sin 0 


where Cbyg and Crgg are the foreground Cb 
and Cr values after key color suppression. 
Early implementations suppressed  fore- 
ground information by multiplying Cb and Cr 
by a clipped version of the Kyg signal. This, 
however, generated in-band alias components 
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Figure 7.22. Rotating the Normalized Cb and Cr (Cb and Cr’) Axes by 6 to Obtain the X and Z 
Axes, Such That the X Axis Passes Through the Desired Key Color (Blue in This Example). 
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Figure 7.23. Foreground Key Values and Acceptance Angle. 
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due to the multiplication and clipping process 
and produced a hard edge at key color bound- 
aries. 

Unless additional processing is done, the 
Cbrg and Crpg components are set to zero 
only if they are exactly on the X axis. Hue vari- 
ations due to noise or lighting will result in 
areas of the foreground not being entirely sup- 
pressed. Therefore, a suppression angle is set, 
symmetrically centered about the positive X 
axis. The suppression angle (f) is typically con- 
figurable from a minimum of zero degrees, to a 
maximum of about one-third the acceptance 
angle (a). Any CbCr components that fall 
within this suppression angle are set to zero. 
Figure 7.24 illustrates the use of the suppres- 
sion angle. 

Foreground luminance, after being nor- 
malized to have a range of 0-1, is suppressed 
by: 


Ypo = Y’-ysKpc 
Yo = Oif ysKkrc >yY’ 


Here, ys is a programmable value and used to 
adjust Ypc so that it is clipped at the black level 
in the key color areas. 

The foreground suppressor also removes 
key-color fringes on wanted foreground areas 
caused by chroma spill, the overspill of the key 
color, by removing discolorations of the 
wanted foreground objects. 

Ultimatte® improves on this process by 
measuring the difference between the blue 
and green colors, as the blue backing is never 
pure blue and there may be high levels of blue 
in the foreground objects. Pure blue is rarely 
found in nature, and most natural blues have a 
higher content of green than red. For this rea- 
son, the red, green, and blue levels are moni- 
tored to differentiate between the blue backing 
and blue in wanted foreground objects. 


If the difference between blue and green is 
great enough, all three colors are set to zero to 
produce black; this is what happens in areas of 
the foreground containing the blue backing. 

If the difference between blue and green is 
not large, the blue is set to the green level 
unless the green exceeds red. This technique 
allows the removal of the bluish tint caused by 
the blue backing while being able to reproduce 
natural blues in the foreground. As an exam- 
ple, a white foreground area normally would 
consist of equal levels of red, green, and blue. 
If the white area is affected by the key color 
(blue in this instance), it will have a bluish 
tint—the blue levels will be greater than the 
red or green levels. Since the green does not 
exceed the red, the blue level is made equal to 
the green, removing the bluish tint. 

There is a price to pay, however. Magenta 
in the foreground is changed to red. A green 
backing can be used, but in this case, yellow in 
the foreground is modified. Usually, the clamp- 
ing is released gradually to increase the blue 
content of magenta areas. 

The key processor generates the initial 
background key signal (K’pg) used to remove 
areas of the background image where the fore- 
ground is to be visible. K’pg is adjusted to be 
zero in desired foreground areas and unity in 
background areas with no attenuation. It is 
generated from the foreground key signal 
(Kpcg) by applying lift (kj) and gain (kc) 
adjustments followed by clipping at zero and 
unity values: 


K’pc = (Kre - ky) Ke 


Figure 7.25 illustrates the operation of the 
background key signal generation. The transi- 
tion between K’pc = 0 and K’gg¢ = 1 should be 
made as wide as possible to minimize disconti- 
nuities in the transitions between foreground 
and background areas. 
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Figure 7.24. Suppression Angle Operation for a Gradual Change from a Red Foreground Object 
to the Blue Key Color. (a) Simple suppression. (b) Improved suppression using a suppression 
angle. 
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Figure 7.25. Background Key Generation. 


For foreground areas containing the same 
CbCr values, but different luminance (Y) val- 
ues, as the key color, the key processor may 
also reduce the background key value as the 
foreground luminance level increases, allow- 
ing turning off the background in foreground 
areas containing a “lighter” key color, such as 
light blue. This is done by: 


Kpo = K’pG - YcY FG 
Kgc = 9 if YeYpG > Kee 


To handle shadows cast by foreground 
objects, and opaque or translucent foreground 
objects, the luminance levels of the blue back- 
ing of the foreground image is monitored. 
Where the luminance of the blue backing is 


reduced, the luminance of the background 
image also is reduced. The amount of back- 
ground luminance reduction must be con- 
trolled so that defects in the blue backing 
(such as seams or footprints) are not inter- 
preted as foreground shadows. 

Additional controls may be implemented to 
enable the foreground and background signals 
to be controlled independently. Examples are 
adjusting the contrast of the foreground so it 
matches the background or fading the fore- 
ground in various ways (such as fading to the 
background to make a foreground object van- 
ish or fading to black to generate a silhouette). 

In the computer environment, there may 
be relatively slow, smooth edges—especially 
edges involving smooth shading. As smooth 
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edges are easily distorted during the chroma 
keying process, a wide keying process is usu- 
ally used in these circumstances. During wide 
keying, the keying signal starts before the 
edge of the graphic object. 


Composite Chroma Keying 
In some instances, the component signals 
(such as YCbCr) are not directly available. For 
these situations, composite chroma keying 
may be implemented, as shown in Figure 7.26. 
To detect the chroma key color, the fore- 
ground video source must be decoded to pro- 
duce the Cb and Cr color difference signals. 
The keying signal, Kpc, is then used to mix 
between the two composite video sources. The 


CB, CR 


garbage matte key signal forces the mixer to 
output the background source by reducing 
Kgc. 

Chroma keying using composite video sig- 
nals usually results in unrealistic keying, since 
there is inadequate color bandwidth. As a 
result, there is a lack of fine detail, and halos 
may be present on edges. 


Superblack Keying 

Video editing systems also may make use of 
superblack keying. In this application, areas of 
the foreground composite video signal that 
have a level of 0 to -5 IRE are replaced with the 
background video information. 


DECODE 


GENERATOR 


KEY GARBAGE 
MATTE 


BACKGROUND 


OUTPUT 


VIDEO 


FOREGROUND + 


~ 
VIDEO 


VIDEO 


Figure 7.26. Typical Composite Chroma Key Circuit. 
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Video Scaling 


With today’s graphical user interfaces (GUIs), 
many computer users want to display video in 
a window. To fit within the window, the video 
source may need to be scaled up or down. 
Many also want to output video for displaying 
on a TV. This may require the contents of the 
video window, or the entire screen, to be 
scaled to another resolution. 

With all the various SDTV and HDTV reso- 
lutions, scaling may also be needed to inter- 
face to a fixed-resolution display or other 
device. However, it is not efficient to first 
decode full HDTV resolution and then scale 
down to SDTV resolution if you never intend to 
use the HDTV signal. The trick is to use down- 
conversion and compression inside the MPEG 
decoder loop. This saves up to 70% of the mem- 
ory, and reduces memory bandwidth. 

When generating objects that will be dis- 
played on SDTV, the computer user must be 
concerned with such things as text size, line 
thickness, and so forth. For example, text 


readable on a 1280 x 1024 computer display 
may not be readable on a SDTV display due to 
the large amount of downscaling involved. 
Thin horizontal lines may either disappear 
completely or flicker at a 25- or 29.97-Hz rate 
when converted to interlaced SDTV. 

Table 7.3 lists some of the common com- 
puter and consumer video resolutions. Note 
that scaling must be performed on component 
video signals (such as R’G’B’ or YCbCr). Com- 
posite color video signals cannot be scaled 
directly due to the color subcarrier phase 
information present, which would be meaning- 
less after scaling. 

For interlaced video systems, field-based 
vertical processing must be done. A MPEG 2 
decoder can determine whether to use field- or 
frame-based vertical processing by looking at 
the progressive_frame flag of the MPEG 2 
video stream. 

The spacing between output samples can 
be defined by a Target Increment (tarinc) 
value: 


Computer Consumer SDTV Consumer HDTV 
640 x 480 720 x 360! 720 x 432! 1280 x 720 
854 x 480 352 x 480 352 x 576 1280 x 1080 
800 x 600 480 x 480 480 x 576 1440 x 1080 
1024 x 768 528 x 480 544 x 576 1920 x 1080 
1280 x 768 544 x 480 720 x 576 
1280 x 1024 640 x 480 768 x 576 
1600 x 1200 720 x 480 960 x 576 
960 x 480 


Table 7.3. Common Active Resolutions for Computer Displays and 
Consumer Video. 116:9 letterbox on a 4:3 SDTV display. 
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tarinc =I / O 


where I and O are the number of input (I) and 
output (O) samples, either horizontally or ver- 
tically. 

The first and last output samples may be 
aligned with the first and last input samples by 
adjusting the equation to be: 


tarinc = (I- 1) / (O- 1) 


Pixel Dropping and Duplication 


This is also called “nearest neighbor” scaling 
since only the input sample closest to the out- 
put sample is used. 

The simplest form of scaling down is pixel 
dropping, where (m) out of every (n) samples 
are thrown away both horizontally and verti- 
cally. A modified version of the Bresenham 
line-drawing algorithm (described in most 
computer graphics books) is typically used to 
determine which samples not to discard. 

Simple upscaling can be accomplished by 
pixel duplication, where (m) out of every (n) 
samples are duplicated both horizontally and 
vertically. Again, a modified version of the 
Bresenham line-drawing algorithm can be 
used to determine which samples to duplicate. 

Scaling using pixel dropping or duplication 
is not recommended due to the visual artifacts 
and the introduction of aliasing components. 


Linear Interpolation 


An improvement in video quality of scaled 
images is possible using linear interpolation. 
When an output sample falls between two input 
samples (horizontally or vertically), the output 
sample is computed by linearly interpolating 
between the two input samples. However, scal- 
ing to images smaller than one-half of the origi- 
nal still results in deleted samples. 


Figure 7.27 illustrates the vertical scaling 
of a 16:9 image to fit on a 4:3 display, acommon 
requirement for DVD players. A simple bi-lin- 
ear vertical filter is commonly used, as shown 
in Figure 7.28a. Two source samples, L, and 
Ly41, are weighted and added together to form 
a destination sample, D,». 


Dy = 0.75Lo + 0.251} 
D, = 0.5L, + 0.5L, 
Do = 0.25Le + 0.75L, 


However, as seen in Figure 7.28a, this results 
in uneven line spacing, which may result in 
visual artifacts. Figure 7.28b illustrates vertical 
filtering that results in the output lines being 
more evenly spaced: 


Do =1o 
Dy - 2/3)Ly + 0/3)L, 
Dy = (1/3)Ly + @/3)Ly 


The linear interpolator is a poor band- 
width-limiting filter. Excess high-frequency 
detail is removed unnecessarily and too much 
energy above the Nyquist limit is still present, 
resulting in aliasing. 


Anti-Aliased Resampling 


The most desirable approach is to ensure the 
frequency content scales proportionally with 
the image size, both horizontally and verti- 
cally. 

Figure 7.29 illustrates the fundamentals of 
an anti-aliased resampling process. The input 
data is upsampled by A and lowpass filtered to 
remove image frequencies created by the 
interpolation process. Filter B bandwidth-limits 
the signal to remove frequencies that will alias 
in the resampling process B. The ratio of B/A 
determines the scaling factor. 
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Figure 7.27. Vertical Scaling of 16:9 Images to Fit on a 
4:3 Display. (a) 480-line systems. (b) 576-line systems. 
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Figure 7.28. 75% Vertical Scaling of 16:9 Images to Fit on a 4:3 Display. 
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(a) Unevenly spaced results. (b) Evenly spaced results. 
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Figure 7.29. General Anti-Aliased Resampling Structure. 


DO 


D1 


D2 


D3 


D4 


DS 


219 


Scan Rate Conversion 


Filters A and B are usually combined into a 
single filter. The response of the filter largely 
determines the quality of the interpolation. 
The ideal lowpass filter would have a very flat 
passband, a sharp cutoff at half of the lowest 
sampling frequency (either input or output), 
and very high attenuation in the stopband. 
However, since such a filter generates ringing 
on sharp edges, it is usually desirable to roll off 
the top of the passband. This makes for 
slightly softer pictures, but with less pro- 
nounced ringing. 

Passband ripple and stopband attenuation 
of the filter provide some measure of scaling 
quality, but the subjective effect of ringing 
means a flat passband might not be as good as 
one might think. Lots of stopband attenuation 
is almost always a good thing. 

There are essentially three variations of 
the general resampling structure. Each com- 
bines the elements of Figure 7.29 in various 
ways. 

One approach is a variable-bandwidth anti- 
aliasing filter followed by a combined interpola- 
tor/resampler. In this case, the filter needs 
new coefficients for each scale factor—as the 
scale factor is changed, the quality of the 
image may vary. In addition, the overall 
response is poor if linear interpolation is used. 
However, the filter coefficients are time-invari- 
ant and there are no gain problems. 

A second approach is a combined filter/ 
interpolator followed by a resampler. Gener- 
ally, the higher the order of interpolation, 2, 
the better the overall response. The center of 
the filter transfer function is always aligned 
over the new output sample. With each scaling 
factor, the filter transfer function is stretched 
or compressed to remain aligned over 7 output 
samples. Thus, the filter coefficients, and the 
number of input samples used, change with 
each new output sample and scaling factor. 


Dynamic gain normalization is required to 
ensure the sum of the filter coefficients is 
always equal to one. 

A third approach is an interpolator fol- 
lowed by a combined filter/resampler. The 
input data is interpolated up to a common mul- 
tiple of the input and output rates by the inser- 
tion of zero samples. This is filtered with a low- 
pass finite-impulse-response (FIR) filter to 
interpolate samples in the zero-filled gaps, then 
re-sampled at the required locations. This type 
of design is usually achieved with a 
“polyphase” filter, switching its coefficients as 
the relative position of input and output sam- 
ples change. 


Scan Rate Conversion 


In many cases, some form of scan rate conver- 
sion (also called temporal rate conversion, 
frame rate conversion, or field rate conversion) 
is needed. Multi-standard analog VCRs and 
scan converters use scan rate conversion to 
convert between various video standards. 
Computers usually operate the display at about 
75 Hz noninterlaced, yet need to display 50- 
and 60-Hz interlaced video. With digital televi- 
sion, multiple refresh rates are supported. 

Note that processing must be performed 
on component video signals (such as R’G’B’ or 
YCbCr). Composite color video signals cannot 
be processed directly due to the color subcar- 
rier phase information present, which would 
be meaningless after processing. 


Frame or Field Dropping and 
Duplicating 
Simple scan-rate conversion may be done by 


dropping or duplicating one out of every N 
fields. For example, the conversion of 60-Hz to 
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50-Hz interlaced operation may drop one out of 
every six fields, as shown in Figure 7.30, using 
a single field store. 

The disadvantage of this technique is that 
the viewer may be see jerky motion, or motion 
“judder.” 

The worst artifacts are present when a 
non-integer scan rate conversion is done—for 
example, when some frames are displayed 
three times, while others are displayed twice. 
In this instance, the viewer will observe double 
or blurred objects. As the human brain tracks 
an object in successive frames, it expects to 
see a regular sequence of positions, and has 
trouble reconciling the apparent stop-start 
motion of objects. As a result, it incorrectly 
concludes that there are two objects moving in 
parallel. 


Temporal Interpolation 


This technique generates new frames from the 
original frames as needed to generate the 
desired frame rate. Information from both past 
and future input frames should be used to opti- 
mally handle objects appearing and disappear- 


ing. 


60 HZ 
INTERLACED 
(WRITE) 


Conversion of 50-Hz to 60-Hz operation 
using temporal interpolation is illustrated in 
Figure 7.31. For every five fields of 50-Hz 
video, there are six fields of 60-Hz video. 

After both sources are aligned, two adja- 
cent 50-Hz fields are mixed together to gener- 
ate a new 60-Hz field. This technique is used in 
some inexpensive standards converters to con- 
vert between 625/50 and 525/60 standards. 
Note that no motion analysis is done. There- 
fore, if the camera operating at 625/50 pans 
horizontally past a narrow vertical object, you 
see one object once every six 525/60 fields, 
and for the five fields in between, you see two 
objects, one fading in while the other fades out. 


625/50 to 525/60 Examples 
Figure 7.32 illustrates a scan rate converter 
that implements vertical, followed by temporal, 
interpolation. Figure 7.33 illustrates the spec- 
tral representation of the design in Figure 7.32. 
Many designs now combine the vertical 
and temporal interpolation into a single design, 
as shown in Figure 7.34, with the correspond- 
ing spectral representation shown in Figure 
7.35. This example uses vertical, followed by 
temporal, interpolation. If temporal, followed 


FIELD 
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Figure 7.30. 60-Hz to 50-Hz Conversion Using a Single Field Store by 
Dropping One out of Every Six Fields. 
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Figure 7.31. 50-Hz to 60-Hz Conversion Using Temporal 
Interpolation with No Motion Compensation. 


by vertical, interpolation were implemented, 
the field stores would be half the size. How- 
ever, the number of line stores would increase 
from four to eight. 

In either case, the first interpolation pro- 
cess must produce an intermediate, higher- 
resolution progressive format to avoid inter- 
lace components that would interfere with the 
second interpolation process. It is insufficient 
to interpolate, either vertically or temporally, 
using a mixture of lines from both fields, due 
to the interpolation process not being able to 
compensate for the temporal offset of inter- 
laced lines. 


Motion Compensation 

Higher-quality scan rate converters using tem- 
poral interpolation incorporate motion com- 
pensation to minimize motion artifacts. This 
results in extremely smooth and _ natural 
motion, and images appear sharper and do not 
suffer from motion “judder.” 

Motion estimation for scan rate conversion 
differs from that used by MPEG. In MPEG, 
the goal is to minimize the displaced frame dif- 
ference (error) by searching for a high corre- 
lation between areas in subsequent frames. 
The resulting motion vectors do not necessar- 
ily correspond to true motion vectors. 


For scan rate conversion, it is important to 
determine true motion information to perform 
correct temporal interpolation. The interpola- 
tion should be tolerant of incorrect motion vec- 
tors to avoid introducing artifacts as 
unpleasant as those the technique is attempt- 
ing to remove. Motion vectors could be incor- 
rect for several reasons, such as insufficient 
time to track the motion, out-of-range motion 
vectors, and estimation difficulties due to alias- 


ing. 


100 Hz Interlaced Television Example 

A standard PAL television shows 50 fields per 
second. The images flicker, especially when 
you look at large areas of highly-saturated 
color. A much improved picture can be 
achieved using a 100 Hz interlaced refresh 
(also called double scan). Of course, this tech- 
nique also applies to generating 120 Hz inter- 
laced televisions for the NTSC markets. 

Early 100 Hz televisions simply repeated 
fields (FY FY FoFoF 3F3F4F4...), as shown in Fig- 
ure 7.36a. However, they still had line flicker, 
where horizontal lines constantly jumped 
between the odd and even lines. This distur- 
bance occurred once every twenty-fifth of a 
second. 
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Figure 7.32. Typical 625/50 to 525/60 Conversion Using Vertical, Followed by Temporal, 
Interpolation. 
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Figure 7.33. Spectral Representation of Vertical, Followed by Temporal, Interpolation. (a) Vertical 
lowpass filtering. (b) Resampling to intermediate sequential format and temporal lowpass filtering. 
(c) Resampling to final standard. 
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Figure 7.34. Typical 625/50 to 525/60 Conversion Using Combined 
Vertical and Temporal Interpolation. 
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Figure 7.35. Spectral Representation of Combined Vertical and Temporal Interpolation. 
(a) Two-dimensional lowpass filtering. (b) Resampling to final standard. 
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Figure 7.36. 50 Hz to 100 Hz (Double Scan Interlaced) Techniques. 
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Scan Rate Conversion 


The field sequence Fy FoF FoF 3F4F3F 4... 
can be used, which solves the line flicker prob- 
lem. Unfortunately, this gives rise to the prob- 
lem of judder in moving images. This can be 
compensated for by using the 
Fy FoF FoF 3F4F3F«... sequence for static 
images, and the FF, FoFoF3F3F4F4... sequence 
for moving images. 

An ideal picture is still not obtained when 
viewing programs created for film. They are 
subject to judder, owing to the fact that each 
film frame is transmitted twice. Instead of the 
field sequence F FYFoFoF 3F3F 4Fy..., the 
situation calls for the sequence 
FF) FoFo F3F 3 F4Fy... (Figure 7.36b), where 
F, is a motion-compensated generated 
image between F, and F,,,1. 


3-2 Pulldown 


For completeness, the conversion of film to 
video is also covered. Film is usually recorded 
at 24 frames per second. 

When converting to PAL or SECAM (50- 
Hz field rate), each film frame is usually 
mapped into 2 video fields (2-2 pulldown), 
resulting in the video program being 4% too 
fast. The best transfers repeat every 12th film 
frame for an extra video field to remove the 4% 
error. 

When converting film to NTSC (59.94-Hz 
field rate), 3-2 pulldown is used, as shown in 
Figure 7.37. The film speed is slowed down by 
0.1% to 23.976 (24/1.001) frames per second. 
Two film frames generate five video fields. In 
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Figure 7.37. Typical 3-2 Pulldown for Transferring Film to NTSC Video. 
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scenes of high-speed motion of objects, the 
specific film frame used for a particular video 
field may be manually adjusted to minimize 
motion artifacts. 

3-2 pulldown may also be used by MPEG 2 
decoders to simply increase the frame rate 
from 23.976 (24/1.001) to 59.94 (60/1.001) 
frames per second, avoiding the deinterlacing 
issue. 

Analog laserdiscs use a white flag signal to 
indicate the start of another sequence of 
related fields for optimum stillframe perfor- 
mance. During still-frame mode, the white flag 
signal tells the system to back up two fields (to 
use two fields that have no motion between 
them) to re-display the current frame. 

Varispeed is commonly used to cover up 
problems such as defects, splicing, censorship 
cuts, or to change the running time of a pro- 
gram. Rather than repeating film frames and 
causing a “stutter,” the 3-2 relationship 
between the film and video is disrupted long 
enough to ensure a smooth temporal rate. 


Noninterlaced-to-Interlaced 
Conversion 


In some applications, it is necessary to display 
a noninterlaced video signal on an interlaced 
display. Thus, some form of “noninterlaced-to- 
interlaced conversion” may be required. 

Noninterlaced to interlaced conversion 
must be performed on component video sig- 
nals (such as R’G’B’ or YCbCr). Composite 
color video signals (such as NTSC or PAL) 
cannot be processed directly due to the pres- 
ence of color subcarrier phase information, 
which would be meaningless after processing. 
These signals must be decoded into compo- 
nent color signals, such as R’G’B’ or YCbCr, 
prior to conversion. 


There are essentially two techniques: scan 
line decimation and vertical filtering. 


Scan Line Decimation 


The easiest approach is to throw away every 
other active scan line in each noninterlaced 
frame, as shown in Figure 7.38. Although the 
cost is minimal, there are problems with this 
approach, especially with the top and bottom of 
objects. 

If there is a sharp vertical transition of 
color or intensity, it will flicker at one-half the 
refresh rate. The reason is that it is only dis- 
played every other field as a result of the deci- 
mation. For example, a horizontal line that is 
one noninterlaced scan line wide will flicker on 
and off. Horizontal lines that are two noninter- 
laced scan lines wide will oscillate up and 
down. 

Simple decimation may also add aliasing 
artifacts. While not necessarily visible, they 
will affect any future processing of the picture. 


Vertical Filtering 


A better solution is to use two or more lines of 
noninterlaced data to generate one line of 
interlaced data. Fast vertical transitions are 
smoothed out over several interlaced lines. 

For a 3-line filter, such as shown in Figure 
7.39, typical coefficients are [0.25, 0.5, 0.25]. 
Using more than 3 lines usually results in 
excessive blurring, making small text difficult 
to read. 

An alternate implementation uses IIR 
rather than FIR filtering. In addition to averag- 
ing, this technique produces a reduction in 
brightness around objects, further reducing 
flicker. 
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Figure 7.38. Noninterlaced-to-Interlaced Conversion Using Scan Line Decimation. 
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Figure 7.39. Noninterlaced-to-Interlaced Conversion Using 3-Line Vertical Filtering. 
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Note that care must be taken at the begin- 
ning and end of each frame in the event that 
fewer scan lines are available for filtering. 


Interlaced-to-Noninterlaced 
Conversion 


In some applications, it is necessary to display 
an interlaced video signal on a noninterlaced 
display. Thus, some form of “deinterlacing” or 
“progressive scan conversion” may be 
required. 

Note that deinterlacing must be performed 
on component video signals (such as R’G’'B’ or 
YCbCr). Composite color video signals (such 
as NTSC or PAL) cannot be deinterlaced 
directly due to the presence of color subcarrier 
phase information, which would be meaning- 
less after processing. These signals must be 
decoded into component color signals, such as 
R’G’B’ or YCbCr, prior to deinterlacing. 


Intrafield Processing 


This is the simplest method, generating addi- 
tional scan lines between the original scan 
lines using only information in the original 
field. The computer industry has coined this as 
“bob.” 

The resulting vertical resolution is always 
limited by the content of the original field. 


Scan Line Duplication 

Scan line duplication (Figure 7.40) simply 
duplicates the previous active scan line. 
Although the number of active scan lines is 
doubled, there is no increase in the vertical 
resolution. 


Scan Line Interpolation 

Scan line interpolation generates interpolated 
scan lines between the original active scan 
lines. Although the number of active scan lines 
is doubled, the vertical resolution is not. 

The simplest implementation, shown in 
Figure 7.41, uses linear interpolation to gener- 
ate a new scan line between two input scan 
lines: 


out, = (iny_y + iNyiy) / 2 


More accurate interpolation, at additional 
cost, may be done by using (sin x) /x interpola- 
tion rather than linear interpolation: 


out, = 0.127in,_5 - 0.21lin,_3 + 0.64in,_j + 
0.64in,,1 — 0.21in, 43 + 0.127ing 5 


Fractional Ratio Interpolation 

In many cases, there is a periodic, but non-inte- 
gral, relationship between the number of input 
scan lines and the number of output scan lines. 
In this case, fractional ratio interpolation may 
be necessary, similar to the polyphase filtering 
used for scaling only performed in the vertical 
direction. This technique combines deinterlac- 
ing and vertical scaling into a single process. 


Variable Interpolation 

In a few cases, there is no periodicity in the 
relationship between the number of input and 
output scan lines. Therefore, in theory, an infi- 
nite number of filter phases and coefficients 
are required. Since this is not feasible, the 
solution is to use a large, but finite, number of 
filter phases. The number of filter phases 
determines the interpolation accuracy. This 
technique also combines deinterlacing and ver- 
tical scaling into a single process. 
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INPUT FIELD OUTPUT FRAME INPUT FIELD OUTPUT FRAME 
ACTIVE LINES ACTIVE LINES ACTIVE LINES ACTIVE LINES 
1 1 1 _ 1 
2=1 2 = (1+3)/2 
2 3 2 _ 3 
4=3 4 = (3+5)/2 
3 5 3 Seemed 5 
6=5 6 = (5+7)/2 
4 7 4 = 7 
8=7 8 = (7+9)/2 
Figure 7.40. Deinterlacing Using Scan Line Figure 7.41. Deinterlacing Using Scan Line 
Duplication. New scan lines are generated by Interpolation. New scan lines are generated 
duplicating the active scan line above it. by averaging the previous and next active 
scan lines. 
FIELD 1 FIELD 2 DEINTERLACED 
ACTIVE ACTIVE FRAME 
LINE LINE ACTIVE LINE 
1 _ 1 
1 ——_. 2 
2 = 3 2 4 6 8 ———_ 
7 —- a 123 4567 8 9 10 
3 = : 1 3 5 7 9 —_ 
3 — 6 
4 = $7 INPUT OUTPUT 
4 8 FIELD FRAME 
NUMBER NUMBER 


Figure 7.42. Deinterlacing Using Field 


Figure 7.43. Producing Deinterlaced 


Merging. Shaded scan lines are generated by Frames at Field Rates. 
using the input scan line from the next or 
previous field. 
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OBJECT POSITION 
IN FIELD ONE 


OBJECT POSITION 
IN FIELD TWO 


OBJECT POSITIONS 
IN MERGED FIELDS 


Figure 7.44. Movement Artifacts When Field 


Merging Is Used. 


Interfield Processing 


In this method, video information from more 
than one field is used to generate a single pro- 
gressive frame. This method can provide 
higher vertical resolution since it uses content 
from more than a single field. The computer 
industry refers to this as “weave,” but “weave” 
also includes the inverse telecine process. 


Field Merging 
This technique merges two consecutive fields 
together to produce a frame of video (Figure 
7.42). At each field time, the active scan lines of 
that field are merged with the active scan lines 
of the previous field. The result is that for each 
input field time, a pair of fields combine to gen- 
erate a frame (see Figure 7.43). 

Although simple to implement conceptu- 
ally, and the vertical resolution is doubled, 


there are artifacts in regions of movement. 
This is due to the time difference between two 
fields—a moving object may be located in a dif- 
ferent position from one field to the next. 
When the two fields are merged, there is a 
“double image” of the moving object (see Fig- 
ure 7.44). 


Motion Adaptive Deinterlacing 
A better solution is to use field merging for still 
areas of the picture and scan line interpolation 
for areas of movement. To accomplish this, 
motion, on a sample-by-sample basis, must be 
detected over the entire picture in real time. 
As two fields are combined, full vertical 
resolution is maintained in still areas of the pic- 
ture, where the eye is most sensitive to detail. 
The sample differences may have any value, 
from 0 (no movement and noise-free) to maxi- 
mum (for example, a change from full intensity 
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to black). A choice must be made when to use 
a sample from the previous field (which is in 
the wrong location due to motion) or to inter- 
polate a new sample from adjacent scan lines in 
the current field. Sudden switching between 
methods is visible, so crossfading (also called 
soft switching) is used. At some magnitude of 
sample difference, the loss of resolution due to 
a double image is equal to the loss of resolu- 
tion due to interpolation. That amount of 
motion should result in the crossfader being at 
the 50% point. Less motion will result in a fade 
towards field merging and more motion in a 
fade towards the interpolated values. 


Motion Compensated Deinterlacing 
Motion compensated deinterlacing is several 
orders of magnitude more complex than 
motion adaptive deinterlacing, and may be 
found in pro-video format converters. 

Motion compensated processing requires 
calculating motion vectors between fields for 
each sample, and interpolating along each 
sample’s motion trajectory. Note that motion 
adaptive processing simply requires detecting 
motion at the sample level, not finding sample 
motion vectors. 

Motion vectors must be found that pass 
through each of the missing samples. Areas of 
the picture may be covered or uncovered as 
you move between frames. The motion vectors 
must have sub-pixel accuracy, and be deter- 
mined in two temporal directions between 
frames. 

For MPEG, motion vector errors are self- 
correcting since the residual difference 
between the predicted macroblocks is 
encoded. As motion compensated deinterlac- 
ing is a single-ended system, motion vector 
errors will produce artifacts, so different 
search and verification algorithms must be 
used. 


Inverse Telecine 
For video signals that use 3-2 pulldown, higher 
interfield deinterlacing performance may be 
obtained by removing duplicate fields prior to 
processing. 

The inverse telecine process detects the 3- 
2 field sequence and the redundant 3rd fields 
are removed. The remaining field pairs are 
merged (since there is no motion between 
them) to form progressive frames, and then 
repeated in a 3-2 progressive frame sequence. 

In the cases where the source is from an 
MPEG decoder, the redundant fields are not 
included in the MPEG video stream. Thus, the 
inverse telecine process may be done by sim- 
ply not implementing the MPEG 2 repeat field 
processing. 


Frequency Response Considerations 


Various two-times vertical upsampling tech- 
niques for deinterlacing may be implemented 
by stuffing zero values between two valid lines 
and filtering, as shown in Figure 7.45. 

Line A shows the frequency response for 
line duplication, in which the lowpass filter 
coefficients for the filter shown are 1, 1, and 0. 

Line interpolation, using lowpass filter 
coefficients of 0.5, 1.0, and 0.5, results in the 
frequency response curve of Line B. Note that 
line duplication results in a better high-fre- 
quency response. Vertical filters with a better 
frequency response than the one for line dupli- 
cation are possible, at the cost of more line 
stores and processing. 

The best vertical frequency response is 
obtained when field merging is implemented. 
The spatial position of the lines is already cor- 
rect and no vertical processing is required, 
resulting in a flat curve (Line C). Again, this 
applies only for stationary areas of the image. 
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DCT-Based Compression 


The transform process of many video compres- 
sion standards is based on the Discrete Cosine 
Transform, or DCT. The easiest way to envi- 
sion it is as a filter bank with all the filters com- 
puted in parallel. 

During encoding, the DCT is usually fol- 
lowed by several other operations, such as 
quantization, zig-zag scanning, run-length 
encoding, and variable-length encoding. Dur- 
ing decoding, this process flow is reversed. 


GAIN 
4 


525 


VERTICAL FREQUENCY 
(CYCLES PER PICTURE HEIGHT) 


TE Se 


Figure 7.45. Frequency Response of Various 


Deinterlacing Filters. (a) Line duplication. (b) Line 


interpolation. (c) Field merging. 


Many times, the terms macroblocks and 
blocks are used when discussing video com- 
pression. Figure 7.46 illustrates the relation- 
ship between these two terms, and shows why 
transform processing is usually done on 8 x 8 
samples. 


DCT 


The 8 x 8 DCT processes an 8 x 8 block of sam- 
ples to generate an 8 x 8 block of DCT coeffi- 
cients, as shown in Figure 7.47. The input may 
be samples from an actual frame of video or 
motion-compensated difference (error) values, 
depending on the encoder mode of operation. 
Each DCT coefficient indicates the amount of a 
particular horizontal or vertical frequency 
within the block. 

DCT coefficient (0,0) is the DC coefficient, 
or average sample value. Since natural images 
tend to vary only slightly from sample to sam- 
ple, low frequency coefficients are typically 
larger values and high frequency coefficients 
are typically smaller values. 

The 8 x 8 DCT is defined in Figure 7.48. 
f(x,y) denotes sample (x, y) of the 8 x 8 input 
block and F(u,v) denotes coefficient (u, v) of 
the DCT transformed block. 

The original 8 x 8 block of samples can be 
recovered using an 8 x 8 inverse DCT (IDCT), 
defined in Figure 7.49. Although exact recon- 
struction is theoretically achievable, it is usu- 
ally not possible due to using finite-precision 
arithmetic. While forward DCT errors can usu- 
ally be tolerated, inverse DCT errors must 
meet the compliance specified in the relevant 
standard. 
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DIVIDE PICTURE 
INTO 16 X 16 BLOCKS 


ear Beer" bvassit 


EACH MACROBLOCK IS 
16 SAMPLES BY 16 LINES 
(4 BLOCKS) 


EACH BLOCK IS 8 
SAMPLES BY 8 LINES 


Figure 7.46. The Relationship between Macroblocks and Blocks. 
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Figure 7.47. The DCT Processes the 8 x 8 Block of Samples or Error Terms to Generate 
an 8 x 8 Block of DCT Coefficients. 
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F(u,v) = 0.25C(u)C(v) ¥ YY. fle.y )cos(((2x + 1)um)/16)cos(((2y + 1)vm)/ 16) 


x=O0y=0 


u,v, x,y=0,1,2,...7 


(x, y) are spatial coordinates in the sample domain 


(u, v) are coordinates in the transform domain 


Figure 7.48. 8 x 8 Two-Dimensional DCT Definition. 
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fee.y) = 0.25C(u)C(v) YY, Flu, v)cos(((2x + 1L)ut)/16) cos(((2y + 1)vt)/ 16) 


x=O0y=0 


Figure 7.49. 8 x 8 Two-Dimensional Inverse DCT (IDCT) Definition. 


Quantization 


The 8 x 8 block of DCT coefficients is quan- 
tized, limiting the number of allowed values for 
each coefficient. This is the first lossly com- 
pression step. Higher frequencies are usually 
quantized more coarsely (fewer values 
allowed) than lower frequencies, due to visual 
perception of quantization error. This results 
in many DCT coefficients being zero, espe- 
cially at the higher frequencies. 


Zig-Zag Scanning 


The quantized DCT coefficients are re- 
arranged into a linear stream by scanning 
them in a zig-zag order. This rearrangement 
places the DC coefficient first, followed by fre- 
quency coefficients arranged in order of 
increasing frequency, as shown in Figures 
7.50, 7.51, and 7.52. This produces long runs of 
zero coefficients. 


Run Length Coding 


The linear stream of quantized frequency coef 
ficients is converted into a series of [run, 
amplitude] pairs. [run] indicates the number 
of zero coefficients, and [amplitude] the non- 
zero coefficient that ended the run. 


Variable-Length Coding 


The [run, amplitude] pairs are coded using a 
variable-length code, resulting in additional 
lossless compression. This produces shorter 
codes for common pairs and longer codes of 
less common pairs. 

This coding method produces a more com- 
pact representation of the DCT coefficients, as 
a large number of DCT coefficients are usually 
quantized to zero and the re-ordering results 
(ideally) in the grouping of long runs of con- 
secutive zero values. 
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0 1 5 6 | 14 | 15 | 27 | 28 
2 4 7 | 13 | 16 | 26 | 29 | 42 mn 
3 8 | 12 | 17 | 25 | 30 | 41 | 43 
9 | 11 | 18 | 24 | 31 | 40 | 44 | 53 

= F 
jo | 19 | 23 | 32 | 39 | 45 | 52 | 54 
20 | 22 | 33 | 38 | 46 | 51 | 55 | 60 
21 | 34 | 37 | 47 | 50 | 56 | 59 | 61 LINEAR ARRAY 

OF 64 FREQUENCY 

35 | 36 | 48 | 49 | 57 | 58 | 62 | 63 COEFFICIENTS 


ZIG-ZAG SCAN OF 
8 X 8 BLOCK OF 
QUANTIZED 
FREQUENCY 
COEFFICIENTS 


Figure 7.50. The 8 x 8 Block of Quantized DCT Coefficients Are Zig-Zag Scanned to 
Arrange in Order of Increasing Frequency. This scanning order is used for H.261, H.263, 
MPEG 1, and progressive pictures in MPEG 2. 


0 4 6 | 20 | 22 | 36 | 38 | 52 
1 5 7 | 21 | 23 | 37 | 39 | 53 n 
2 8 | 19 | 24 | 34 | 40 | 50 | 54 
3 9 | 18 | 25 | 35 | 41 | 51 | 55 

= F 
10 | 17 | 26 | 30 | 42 | 46 | 56 | 60 
11 | 16 | 27 | 31 | 43 | 47 | 57 | 61 
12 | 15 | 28 | 32 | 44 | 48 | 58 | 62 LINEAR ARRAY 

OF 64 FREQUENCY 

13 | 14 | 29 | 33 | 45 | 49 | 59 | 63 COEFFICIENTS 


ZIG-ZAG SCAN OF 
8 X 8 BLOCK OF 
QUANTIZED 
FREQUENCY 
COEFFICIENTS 


Figure 7.51. MPEG 2 and H.263 Alternate Zig-Zag Scanning Order. 
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0 1 2 3 10 11 12 13 
4 5 8 9 17 16 15 14 
6 7 19 18 26 27 28 29 


52 53 54 55 60 61 62 63 


ZIG-ZAG SCAN OF 
8 X 8 BLOCK OF 
QUANTIZED 
FREQUENCY 
COEFFICIENTS 


LINEAR ARRAY 
OF 64 FREQUENCY 
COEFFICIENTS 


Figure 7.52. H.263 Alternate Zig-Zag Scanning Order. 
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NTSC, PAL, and 
SECAM Overview 


To fully understand the NTSC, PAL, and 
SECAM encoding and decoding processes, it 
is helpful to review the background of these 
standards and how they came about. 


NTSC Overview 


The first color television system was developed 
in the United States, and on December 17, 
1953, the Federal Communications Commis- 
sion (FCC) approved the transmission stan- 
dard, with broadcasting approved to begin 
January 23, 1954. Most of the work for develop- 
ing a color transmission standard that was 
compatible with the (then current) 525-line, 60- 
field-per-second, 2:1 interlaced monochrome 
standard was done by the National Television 
System Committee (NTSC). 


Luminance Information 


The monochrome luminance (Y) signal is 
derived from gamma-corrected red, green, and 
blue (R’G’B) signals: 


Y = 0.299R’ + 0.587G’ + 0.114B’ 


Due to the sound subcarrier at 4.5 MHz, a 
requirement was made that the color signal fit 
within the same bandwidth as the mono- 
chrome video signal (0-4.2 MHz). 

For economic reasons, another require- 
ment was made that monochrome receivers 
must be able to display the black and white 
portion of a color broadcast and that color 
receivers must be able to display a mono- 
chrome broadcast. 


Color Information 


The eye is most sensitive to spatial and tempo- 
ral variations in luminance; therefore, lumi- 
nance information was still allowed the entire 
bandwidth available (0-4.2 MHz). Color infor- 
mation, to which the eye is less sensitive and 
which therefore requires less bandwidth, is 
represented as hue and saturation information. 

The hue and saturation information is 
transmitted using a 3.58-MHz_ subcarrier, 
encoded so that the receiver can separate the 
hue, saturation, and luminance information 
and convert them back to RGB signals for dis- 
play. Although this allows the transmission of 
color signals within the same bandwidth as 
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monochrome signals, the problem - still 
remains as to how to cost-effectively separate 
the color and luminance information, since 
they occupy the same portion of the frequency 
spectrum. 

To transmit color information, U and V or I 
and Q “color difference” signals are used: 


R’ - Y = 0.701R’ - 0.587G’ - 0.114B’ 
B’- Y = -0.299R’ — 0.587G’ + 0.886B’ 


U = 0.492(B’ - Y) 
V = 0.877(R’ - Y) 


I = 0.596R’ — 0.275G’ — 0.321B’ 
= Vcos 33° — Usin 33° 
= 0.736(R’ — Y) — 0.268(B’ — Y) 


Q = 0.212R’ — 0.523G’ + 0.311B’ 
= Vsin 33° + Ucos 33° 
= 0.478(R’ — Y) + 0.413 (B’ - Y) 


The scaling factors to generate U and V 
from (B’ — Y) and (R’ — Y) were derived due to 
overmodulation considerations during trans- 
mission. If the full range of (B’ — Y) and (R’ - 
Y) were used, the modulated chrominance lev- 
els would exceed what the monochrome trans- 
mitters were capable of supporting. 
Experimentation determined that modulated 
subcarrier amplitudes of 20% of the Y signal 
amplitude could be permitted above white and 
below black. The scaling factors were then 
selected so that the maximum level of 75% 
color would be at the white level. 

I and Q were initially selected since they 
more closely related to the variation of color 
acuity than U and V. The color response of the 
eye decreases as the size of viewed objects 
decreases. Small objects, occupying frequen- 
cies of 1.3-2.0 MHz, provide little color sensa- 
tion. Medium objects, occupying the 0.6-1.3 


MHz frequency range, are acceptable if repro- 
duced along the orange-cyan axis. Larger 
objects, occupying the 0-0.6 MHz frequency 
range, require full three-color reproduction. 

The I and Q bandwidths were chosen 
accordingly, and the preferred color reproduc- 
tion axis was obtained by rotating the U and V 
axes by 33°. The Q component, representing 
the green-purple color axis, was band-limited 
to about 0.6 MHz. The I component, represent- 
ing the orange-cyan color axis, was band-lim- 
ited to about 1.3 MHz. 

Another advantage of limiting the I and Q 
bandwidths to 1.3 MHz and 0.6 MHz, respec- 
tively, is to minimize crosstalk due to asymmet- 
rical sidebands as a result of lowpass filtering 
the composite video signal to about 4.2 MHz. 
Q is a double sideband signal; however, I is 
asymmetrical, bringing up the possibility of 
crosstalk between I and Q. The symmetry of Q 
avoids crosstalk into I; since Q is bandwidth 
limited to 0.6 MHz, I crosstalk falls outside the 
Q bandwidth. 

Advances in electronics have prompted 
changes. U and V, both bandwidth-limited to 
1.3 MHz, are now commonly used instead of I 
and Q. A greater amount of processing is 
required in the decoder due to both U and V 
being asymmetrical about the color subcarrier. 

The UV and IQ vector diagram is shown in 
Figure 8.1. 


Color Modulation 


I and Q (or U and V) are used to modulate a 
3.58-MHz color subcarrier using two balanced 
modulators operating in phase quadrature: one 
modulator is driven by the subcarrier at sine 
phase, the other modulator is driven by the 
subcarrier at cosine phase. The outputs of the 
modulators are added together to form the 
modulated chrominance signal: 
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C =Q sin (at + 33°) + I cos (at + 33°) 
= 20F sc 
Fc = 3.579545 MHz (+ 10 Hz) 


or, if U and V are used instead of I and Q: 


C=U sin wt+ Vcos at 


Hue information is conveyed by the 
chrominance phase relative to the subcarrier. 
Saturation information is conveyed by chromi- 
nance amplitude. In addition, if an object has 
no color (such as a white, gray, or black 
object), the subcarrier is suppressed. 


167° 


Composite Video Generation 


The modulated chrominance is added to the 
luminance information along with appropriate 
horizontal and vertical sync signals, blanking 
information, and color burst information, to 
generate the composite color video waveform 
shown in Figure 8.2. 


composite NTSC = Y + Q sin (at + 33°) 
+I cos (mt + 33°) + timing 


or, if U and V are used instead of I and Q: 


composite NTSC = Y + U sin at 
+V cos ot + timing 


IRE SCALE UNITS 


MAGENTA 
61° 


Figure 8.1. UV and IQ Vector Diagram for 75% Color Bars. 
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Figure 8.2. (M) NTSC Composite Video Signal for 75% Color Bars. 
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The bandwidth of the resulting composite 
video signal is shown in Figure 8.3. 

The I and Q (or U and V) information can 
be transmitted without loss of identity as long 
as the proper color subcarrier phase relation- 
ship is maintained at the encoding and decod- 
ing process. A color burst signal, consisting of 
nine cycles of the subcarrier frequency at a 
specific phase, follows most horizontal sync 
pulses, and provides the decoder a reference 
signal so as to be able to properly recover the I 
and Q (or U and V) signals. The color burst 
phase is defined to be along the —U axis as 
shown in Figure 8.1. 


Color Subcarrier Frequency 


The specific choice for the color subcarrier fre- 
quency was dictated by several factors. The 
first was the need to provide horizontal inter- 
lace to reduce the visibility of the subcarrier, 
requiring that the subcarrier frequency, Fxc, 
be an odd multiple of one-half the horizontal 
line rate. The second factor was selection of a 
frequency high enough that it generated a fine 
interference pattern having low visibility. 
Third, double sidebands for I and Q (or U and 
V) bandwidths below 0.6 MHz had to be 
allowed. 
The choice of the frequencies is: 


Fy = (4.5 x 10°/286) Hz = 15,734.27 Hz 
Fy = Fy/(525/2) = 59.94 Hz 


Foc = ((13 x7x 5)/2) x Fy = (455/2) x Fy 
= 3.579545 MHz 


The resulting Fy (field) and Fy (line) rates 
were slightly different from the monochrome 
standards, but fell well within the tolerance 
ranges and were therefore acceptable. Figure 
8.4 illustrates the resulting spectral interleav- 


ing. 


The luminance (Y) components are modu- 
lated due to the horizontal blanking process, 
resulting in bunches of luminance information 
spaced at intervals of Fy. These signals are fur- 
ther modulated by the vertical blanking pro- 
cess, resulting in luminance frequency 
components occurring at NFy + MFy. N has a 
maximum value of about 277 with a 4.2-MHz 
bandwidth-limited luminance. Thus, luminance 
information is limited to areas about integral 
harmonics of the line frequency (Fy), with 
additional spectral lines offset from NFy by the 
29.97-Hz vertical frame rate. 

The area in the spectrum between lumi- 
nance groups, occurring at odd multiples of 
one-half the line frequency, contains minimal 
spectral energy and is therefore used for the 
transmission of chrominance information. The 
harmonics of the color subcarrier are sepa- 
rated from each other by Fy since they are odd 
multiples of one-half Fy, providing a half-line 
offset and resulting in an interlace pattern that 
moves upward. Four complete fields are 
required to repeat a specific sample position, 
as shown in Figure 8.5. 


NTSC Variations 


There are three common variations of NTSC, 
as shown in Figures 8.6 and 8.7. 

The first, called “NTSC 4.43,” is commonly 
used for multistandard analog VCRs. The hori- 
zontal and vertical timing is the same as (M) 
NTSC; color encoding uses the PAL modula- 
tion format and a 4.43361875 MHz color sub- 
carrier frequency. 

The second, “NTSC-J,” is used in Japan. It 
is the same as (M) NTSC, except there is no 
blanking pedestal during active video. Thus, 
active video has a nominal amplitude of 714 
mV. 
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CHROMINANCE 


AMPLITUDE SUBCARRIER 
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a}|a 
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Figure 8.3. Video Bandwidths of Baseband (M) NTSC Video. (a) 
Using 1.3-MHz | and 0.6-MHz Q signals. (b) Using 1.3-MHz U and 
V signals. 
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Figure 8.4. Luma and Chroma Frequency Interleave Principle. 
Note that 227.5Fy = Fsc. 
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Figure 8.5. Four-field (M) NTSC Sequence and Burst Blanking. 
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PHASE = HUE 
AMPLITUDE = SATURATION 


“M" 


LINE / FIELD = 525/ 59.94 
FH = 15.734 KHZ 
FV = 59.94 HZ 
FSC = 3.579545 MHZ 


BLANKING SETUP = 7.5 IRE 
VIDEO BANDWIDTH = 4.2 MHZ 
AUDIO CARRIER = 4.5 MHZ 
CHANNEL BANDWIDTH = 6 MHZ 


"NTSC-J" 


LINE/ FIELD = 525/ 59.94 
FH = 15.734 KHZ 
FV = 59.94 HZ 
FSC = 3.579545 MHZ 


BLANKING SETUP = 0 IRE 
VIDEO BANDWIDTH = 4.2 MHZ 
AUDIO CARRIER = 4.5 MHZ 
CHANNEL BANDWIDTH = 6 MHZ 


"NTSC 4.43" 


LINE/ FIELD = 525/ 59.94 
FH = 15.734 KHZ 
FV = 59.94 HZ 
FSC = 4.43361875 MHZ 


BLANKING SETUP = 7.5 IRE 
VIDEO BANDWIDTH = 4.2 MHZ 
AUDIO CARRIER = 4.5 MHZ 
CHANNEL BANDWIDTH = 6 MHZ 


Figure 8.6. Common NTSC Systems. 


The third, called “noninterlaced NTSC,” is 
a 262-line, 60 frames-per-second version of 
NTSC, as shown in Figure 8.7. This format is 
identical to standard (M) NTSC, except that 
there are 262 lines per frame. 


RF Modulation 


Figures 8.8, 8.9, and 8.10 illustrate the basic 
process of converting baseband (M) NTSC 
composite video to a RF (radio frequency) sig- 
nal. 

Figure 8.8a shows the frequency spectrum 
of a baseband composite video signal. It is sim- 
ilar to Figure 8.3. However, Figure 8.3 only 
shows the upper sideband for simplicity. The 
“video carrier” notation at 0 MHz serves only 
as a reference point for comparison with Fig- 
ure 8.8b. 

Figure 8.8b shows the audio/video signal 
as it resides within a 6-MHz channel (such as 
channel 3). The video signal has been lowpass 


filtered, most of the lower sideband has been 
removed, and audio information has been 
added. 

Figure 8.8c details the information present 
on the audio subcarrier for stereo (BTSC) 
operation. 

As shown in Figures 8.9 and 8.10, back 
porch clamping (see glossary) of the analog 
video signal ensures that the back porch level 
is constant, regardless of changes in the aver- 
age picture level. White clipping of the video 
signal prevents the modulated signal from 
going below 10%; below 10% may result in over- 
modulation and “buzzing” in television receiv- 
ers. The video signal is then lowpass filtered to 
4.2 MHz and drives the AM (amplitude modu- 
lation) video modulator. The sync level corre- 
sponds to 100% modulation, the blanking 
corresponds to 75%, and the white level corre- 
sponds to 10%. (M) NTSC systems use an IF 
(intermediate frequency) for the video of 45.75 
MHz. 
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BURST BEGINS WITH POSITIVE HALF-CYCLE 
BURST PHASE = REFERENCE PHASE = 180° RELATIVE TO U 


BURST BEGINS WITH NEGATIVE HALF-CYCLE 
| BURST PHASE = REFERENCE PHASE = 180° RELATIVE TO U 


Figure 8.7. Noninterlaced NTSC Frame Sequence. 


At this point, audio information is added on 
a subcarrier at 41.25 MHz. A monaural audio 
signal is processed as shown in Figure 8.9 and 
drives the FM (frequency modulation) modula- 
tor. The output of the FM modulator is added 
to the IF video signal. 

The SAW filter, used as a vestigial side- 
band filter, provides filtering of the IF signal. 
The mixer, or up converter, mixes the IF signal 
with the desired broadcast frequency. Both 
sum and difference frequencies are generated 
by the mixing process, so the difference signal 
is extracted by using a bandpass filter. 


Stereo Audio (Analog) 


BTSC 

The implementation of stereo audio, known as 
the BTSC system (Broadcast Television Sys- 
tems Committee), is shown in Figure 8.10. 
Countries that use this system include the 
United States, Canada, Mexico, Brazil, and Tai- 
wan. 


To enable stereo, L-R information is trans- 
mitted using a suppressed AM subcarrier. A 
SAP (secondary audio program) channel may 
also be present, commonly used to transmit a 
second language or video description. A pro- 
fessional channel may also be present, allow- 
ing communication with remote equipment 
and people. 


Zweiton M 

This implementation of analog stereo audio 
(ITU-R BS.707), also known as A2 M, is similar 
to that used with PAL. The L+R information is 
transmitted on a FM subcarrier at 4.5 MHz. 
The L-R information, or a second L+R audio 
signal, is transmitted on a second FM subcar- 
rier at 4.724212 MHz. 

If stereo or dual mono signals are present, 
the FM subcarrier at 4.724212 MHz is ampli- 
tude-modulated with a 55.0699 kHz subcarrier. 
This 55.0699 kHz subcarrier is 50% amplitude- 
modulated at 149.9 Hz to indicate stereo audio 
or 276.0 Hz to indicate dual mono audio. 

This system is used in South Korea. 
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VIDEO 
CARRIER 
CHROMINANCE CHROMINANCE 
SUBCARRIER SUBCARRIER 
| | | = FREQUENCY (MHZ) 
-45 -42 -3.58 -3.0 -1.0 0.0 1.0 30 358 42 45 
(A) 
VIDEO 
CARRIER 
CHROMINANCE 
0.75 MHZ SUBCARRIER 
VESTIGIAL 
SIDEBAND AUDIO 
CARRIER 
TI | = FREQUENCY (MHZ) 
-4.0 -3.0 -0.75 0.0 1.0 30 358 42 45 | 5.0 
6 MHZ CHANNEL 
Le | 
1.25 4.75 
(B) 
AUDIO 
CARRIER FH = 15,734 HZ 
4 
STEREO 
PILOT 
PROFESSIONAL 
CHANNEL (FM) 
L+R L-R SAP 
(FM) (AM) (FM) 
| | = FREQUENCY 
0.0 FH 2FH 3FH 4FH 5 FH 6.5 FH 
(C) 


Figure 8.8. Transmission Channel for (M) NTSC. (a) Frequency spectrum of baseband composite 
video. (b) Frequency spectrum of typical channel including audio information. (c) Detailed 
frequency spectrum of BTSC stereo audio information. 
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EIA- 
This implementation for analog stereo audio is 
similar to BTSC, and is used in Japan. The L+R 
information is transmitted on a FM subcarrier 
at 4.5 MHz. The L-R signal, or a second L+R 
signal, is transmitted on a second FM subcar- 
rier at +2Fy. 

If stereo or dual mono signals are present, 
a +3.5Fy subcarrier is amplitude-modulated 
with either a 982.5 Hz subcarrier (stereo 
audio) or a 922.5 Hz subcarrier (dual mono 
audio). 


Analog Channel Assignments 


Tables 8.1 through 8.4 list the typical channel 
assignments for VHF, UHF, and cable for vari- 
ous NTSC systems. 


AUDIO LEFT 


75 uS 


FM 
MODULATOR 


ES 


—— 


PRE-EMPHASIS 


AUDIO RIGHT 


50-15000 HZ BPF 


Note that cable systems routinely reassign 
channel numbers to alternate frequencies to 
minimize interference and provide multiple 
levels of programming (such as regular and 
preview premium movie channels). 


Use by Country 


Figure 8.6 shows the common designations for 
NTSC systems. The letter “M” refers to the 
monochrome standard for line and field rates 
(525/59.94), a video bandwidth of 4.2 MHz, an 
audio carrier frequency 4.5 MHz above the 
video carrier frequency, and a RF channel 
bandwidth of 6 MHz. The “NTSC” refers to the 
technique to add color information to the 
monochrome signal. Detailed timing parame- 
ters can be found in Table 8.9. 


41-47 MHZ 
BANDWIDTH 
BANDPASS 


SAW / MIXER 


MODULATED RF 


BACK 
PORCH 
(M) NTSC CLAMP 

COMPOSITE ——+#| AND See aces 
VIDEO WHITE 
LEVEL 
CLIP 

45.75 MHZ 
5 KHZ IF VIDEO CARRIER 


CLOCK 


41.25 MHZ 
IF AUDIO CARRIER 


VIDEO CARRIER 
OF DESIRED CHANNEL 


AUDIO / VIDEO 


“| FILTER (UP CONVERTER) FILTER 
(6 MHZ BANDWIDTH) 


r 


Figure 8.9. Typical RF Modulation Implementation for (M) NTSC: Mono Audio. 
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PROFESSIONAL 


CHANNEL =——————_—] 


AUDIO 


SECONDARY 
AUDIO 


—— 


150 uS 
PRE-EMPHASIS 


300-3,400 HZ BPF 


FM 


MODULATOR 


PROFESSIONAL CHANNEL 


41.25 MHZ — 6.5FH 
IF AUDIO CARRIER 


50-10,000 HZ BPF 


BTSC 
COMPRESSION 


FM 


MODULATOR 


SECONDARY AUDIO PROGRAM (SAP) 


| 


41.25 MHZ — 5FH 
IF AUDIO CARRIER 


FM STEREO 41.25 MHZ — FH 
PILOT SIGNAL 
L-R 
50-15,000 HZ BPF 
[| 
-————*"] BTSC 
COMPRESSION 
STEREO 
MODULATOR 
AUDIO LEFT = ———__@—_+_»} 75 us 
PRE-EMPHASIS 
AUDIO RIGHT —_—_@——_——_+|__ .......... Sai, 
50-15,000 HZ BPF 
BACK 
PORCH 
courosite ——=| “ano” eg ae ey 
LPF MODULATOR 
VIDEO WHITE 
LEVEL 
CLIP { 


45.75 MHZ 


IF VIDEO CARRIER 


41-47 MHZ 
BANDWIDTH 


SAW 
FILTER 


MIXER 
(UP CONVERTER) 


BANDPASS 
FILTER 


CHANNEL 
SELECT 


MODULATED RF 
AUDIO / VIDEO 


Figure 8.10. Typical RF Modulation Implementation for (M) NTSC: BTSC Stereo Audio. 
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Broadcast Video Audio Channel Broadcast Video Audio Channel 
Channel Carrier Carrier Range Channel Carrier Carrier Range 
(MHz) (MHz) (MHz) (MHz) (MHz) (MHz) 
- - - - 40 627.25 631.75 626-632 
- - - - 41 633.25 637.75 632-638 
2 59.25 59.75 54-60 42 639.25 643.75 638-644 
3 61.25 65.75 60-66 43 645.25 649.75 644-650 
4 67.25 71.75 66-72 44 651.25 655.75 650-656 
5 77.25 81.75 76-82 45 657.25 661.75 656-662 
6 83.25 87.75 82-88 46 663.25 667.75 662-668 
7 175.25 179.75 174-180 47 669.25 673.75 668-674 
8 181.25 185.75 180-186 48 675.25 679.75 674-680 
9 187.25 191.75 186-192 49 681.25 685.75 680-686 
10 193.25 197.75 192-198 50 687.25 691.75 686-692 
11 199.25 203.75 198-204 51 693.25 697.75 692-698 
12 205.25 209.75 204-210 52 699.25 703.75 698-704 
13 211.25 215.75 210-216 53 705.25 709.75 704-710 
14 471.25 475.75 470-476 54 711.25 715.75 710-716 
15 477.25 481.75 476-482 55 717.25 721.75 716-722 
16 483.25 487.75 482-488 56 723.25 727.75 722-728 
17 489.25 493.75 488-494 57 729.25 733.75 728-734 
18 495.25 499.75 494-500 58 735.25 739.75 734-740 
19 501.25 505.75 500-506 59 741.25 745.75 740-746 
20 507.25 911.75 506-512 60 747.25 751.75 746-752 
21 513.25 917.75 512-518 61 753.25 757.75 752-758 
22 519.25 523.75 518-524 62 759.25 763.75 758-764 
23 925.25 529.75 524-530 63 765.25 769.75 764-770 
24 931.25 535.75 530-536 64 771.25 775.75 770-776 
25 537.25 941.75 536-542 65 777.25 781.75 776-782 
26 943.25 547.75 542-548 66 783.25 787.75 782-788 
27 549.25 593.75 548-554 67 789.25 793.75 788-794 
28 599.25 559.75 594-560 68 795.25 799.75 794-800 
29 561.25 565.75 560-566 69 801.25 805.75 800-806 
30 567.25 971.75 566-572 
31 573.25 977.75 572-578 
32 579.25 583.75 578-584 
33 585.25 589.75 584-590 
34 591.25 595.75 590-596 
35 597.25 601.75 596-602 
36 603.25 607.75 602-608 
37 609.25 613.75 608-614 
38 615.25 619.75 614-620 
39 621.25 625.75 620-626 


Table 8.1. Analog Broadcast Nominal Frequencies for North and South America. 
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Broadcast Video Audio Channel Broadcast Video Audio Channel 
Channel Carrier Carrier Range Channel Carrier Carrier Range 
(MHz) (MHz) (MHz) (MHz) (MHz) (MHz) 
- - - - 40 633.25 637.75 632-638 
1 91.25 95.75 90-96 41 639.25 643.75 638-644 
2 97.25 101.75 96-102 42 645.25 649.75 644-650 
3 103.25 107.75 102-108 43 651.25 655.75 650-656 
4 171.25 175.75 170-176 44 657.25 661.75 656-662 
5 177.25 181.75 176-182 45 663.25 667.75 662-668 
6 183.25 187.75 182-188 46 669.25 673.75 668-674 
7 189.25 193.75 188-194 47 675.25 679.75 674-680 
8 193.25 197.75 192-198 48 681.25 685.75 680-686 
9 199.25 203.75 198-204 49 687.25 691.75 686-692 
10 205.25 209.75 204-210 50 693.25 697.75 692-698 
11 211.25 215.75 210-216 51 699.25 703.75 698-704 
12 217.25 221.75 216-222 52 705.25 709.75 704-710 
13 471.25 475.75 470-476 53 711.25 715.75 710-716 
14 477.25 481.75 476-482 54 717.25 721.75 716-722 
15 483.25 487.75 482-488 55 723.25 727.75 722-728 
16 489.25 493.75 488-494 56 729.25 733.75 728-734 
17 495.25 499.75 494-500 57 735.25 739.75 734-740 
18 501.25 505.75 500-506 58 741.25 745.75 740-746 
19 507.25 911.75 506-512 59 747.25 791.75 746-752 
20 513.25 917.75 512-518 60 753.25 757.75 752-758 
21 519.25 523.75 518-524 61 759.25 763.75 758-764 
22 925.25 529.75 524-530 62 765.25 769.75 764-770 
23 931.25 939.75 530-536 - - - - 
24 537.25 941.75 536-542 - - - - 
25 943.25 947.75 542-548 - - - - 
26 549.25 993.75 548-554 - - - - 
27 599.29 599.75 594-560 - - - - 
28 561.25 565.75 560-566 - - - - 
29 567.25 971.75 566-572 - - - - 
30 973.25 977.75 972-578 
31 579.25 583.75 578-584 
32 585.25 589.75 584-590 
33 591.25 595.75 590-596 
34 597.25 601.75 596-602 
35 603.25 607.75 602-608 
36 609.25 613.75 608-614 
37 615.25 619.75 614-620 
38 621.25 625.75 620-626 
39 627.25 631.75 626-632 


Table 8.2. Analog Broadcast Nominal Frequencies for Japan. 
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Cable Video Audio Channel Cable Video Audio Channel 
Channel Carrier Carrier Range Channel Carrier Carrier Range 
(MHz) (MHz) (MHz) (MHz) (MHz) (MHz) 

- - - - 40 319.2625 323.7625 318-324 
- - - - 41 325.2625 329.7625 324-330 
2 59.25 59.75 54-60 42 331.2750 335.7750 330-336 
3 61.25 65.75 60-66 43 337.2625 341.7625 336-342 
4 67.25 71.75 66-72 44 343.2625 347.7625 342-348 
5 77.25 81.75 76-82 45 349.2625 353.7625 348-354 
6 83.25 87.75 82-88 46 355.2625 359.7625 354-360 
7 175.25 179.75 174-180 47 361.2625 365.7625 360-366 
8 181.25 185.75 180-186 48 367.2625 371.7625 366-372 
9 187.25 191.75 186-192 49 373.2625 377.7625 372-378 
10 193.25 197.75 192-198 50 379.2625 383.7625 378-384 
11 199.25 203.75 198-204 51 385.2625 389.7625 384-390 
12 205.25 209.75 204-210 52 391.2625 395.7625 390-396 
13 211.25 215.75 210-216 53 397.2625 401.7625 396-402 
14 121.2625 125.7625 120-126 54 403.25 407.75 402-408 
15 127.2625 131.7625 126-132 55 409.25 413.75 408-414 
16 133.2625 137.7625 132-138 56 415.25 419.75 414-420 
17 139.25 143.75 138-144 57 421.25 425.75 420-426 
18 145.25 149.75 144-150 58 427.25 431.75 426-432 
19 151.25 155.75 150-156 59 433.25 437.75 432-438 
20 157.25 161.75 156-162 60 439.25 443.75 438-444 
21 163.25 167.75 162-168 61 445.55 449.75 444-450 
22 169.25 173.75 168-174 62 451.25 455.75 450-456 
23 217.25 221.75 216-222 63 457.25 461.75 456-462 
24 223.25 227.75 222-228 64 463.25 467.75 462-468 
25 229.2625 233.7625 228-234 65 469.25 473.75 468-474 
26 235.2625 239.7625 234-240 66 475.25 479.75 474-480 
27 241.2625 245.7625 240-246 67 481.25 485.75 480-486 
28 247.2625 251.7625 246-252 68 487.25 491.75 486-492 
29 253.2625 257.7625 252-258 69 493.25 497.75 492-498 
30 259.2625 263.7625 258-264 70 499.25 503.75 498-504 
31 265.2625 269.7625 264-270 71 505.25 509.75 504-510 
32 271.2625 275.7625 270-276 72 911.25 515.75 510-516 
33 277.2625 281.7625 276-282 73 917.25 921.75 516-522 
34 283.2625 287.7625 282-288 74 523.25 927.75 522-528 
35 289.2625 293.7625 288-294 75 529.25 533.75 528-534 
36 295.2625 299.7625 294-300 76 939.25 539.75 534-540 
37 301.2625 305.7625 300-306 V7 941.25 945.75 540-546 
38 307.2625 311.7625 306-312 78 947.25 991.75 546-552 
39 313.2625 317.7625 312-318 79 553.25 997.75 552-558 


Table 8.3a. Standard Analog Cable TV Nominal Frequencies for USA. 
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Cable Video Audio Channel Cable Video Audio Channel 
Channel Carrier Carrier Range Channel Carrier Carrier Range 
(MHz) (MHz) (MHz) (MHz) (MHz) (MHz) 
80 559.25 563.75 558-564 120 769.25 773.75 768-774 
81 565.25 569.75 564-570 121 775.25 779.75 774-780 
82 971.25 975.75 570-576 122 781.25 785.75 780-786 
83 977.25 581.75 576-582 123 787.25 791.75 786-792 
84 583.25 987.75 582-588 124 793.25 797.75 792-798 
85 589.25 593.75 588-594 125 799.25 803.75 798-804 
86 595.25 599.75 594-600 126 805.25 809.75 804-810 
87 601.25 605.75 600-606 127 811.25 815.75 810-816 
88 607.25 611.75 606-612 128 817.25 821.75 816-822 
89 613.25 617.75 612-618 129 823.25 827.75 822-828 
90 619.25 623.75 618-624 130 829.25 833.75 828-834 
91 625.25 629.75 624-630 131 835.25 839.75 834-840 
92 631.25 635.75 630-636 132 841.25 845.75 840-846 
93 637.25 641.75 636-642 133 847.25 851.75 846-852 
94 643.25 647.75 642-648 134 853.25 857.75 852-858 
95 91.25 95.75 90-96 135 859.25 863.75 858-864 
96 97.25 101.75 96-102 136 865.25 869.75 864-870 
97 103.25 107.75 102-108 137 871.25 875.75 870-876 
98 109.2750 113.7750 108-114 138 877.25 881.75 876-882 
99 115.2750 119.7750 114-120 139 883.25 887.75 882-888 
100 649.25 653.75 648-654 140 889.25 893.75 888-894 
101 655.25 659.75 654-660 141 895.25 899.75 894-900 
102 661.25 665.75 660-666 142 901.25 905.75 900-906 
103 667.25 671.75 666-672 143 907.25 911.75 906-912 
104 673.25 677.75 672-678 144 913.25 917.75 912-918 
105 679.25 683.75 678-684 145 919.25 923.75 918-924 
106 685.25 689.75 684-690 146 925.25 929.75 924-930 
107 691.25 695.75 690-696 147 931.25 935.75 930-936 
108 697.25 701.75 696-702 148 937.25 941.75 936-942 
109 703.25 707.75 702-708 149 943.25 947.75 942-948 
110 709.25 713.75 708-714 150 949.25 953.75 948-954 
111 715.25 719.75 714-720 151 955.25 959.75 954-960 
112 721.25 725.75 720-726 152 961.25 965.75 960-966 
113 727.25 731.75 726-732 153 967.25 971.75 966-972 
114 733.25 737.75 732-738 154 973.25 977.75 972-978 
115 739.25 743.75 738-744 155 979.25 983.75 978-984 
116 745.25 749.75 744-750 156 985.25 989.75 984-990 
117 751.25 759.75 750-756 157 991.25 995.75 990-996 
118 757.25 761.75 756-762 158 997.25 1001.75 996-1002 
119 763.25 767.75 762-768 - - - - 
T7 7-13 T11 31-37 
T8 13-19 T13 37-43 
T9 19-25 T13 43-49 
T10 25-31 714 46-55 


Table 8.3b. Standard Analog Cable TV Nominal Frequencies for USA. T channels are 
reverse (return) channels for two-way applications. 
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Video Audio Video Audio 
Cable ; Cable ; . 

Channel Carrier Carrier Channel Carrier Carrier 

(MHz) (MHz) (MHz) (MHz) 
- - - 40 319.2625 323.7625 
1 73.2625 77.7625 41 325.2625 329.7625 
2 59.2625 59.7625 42 331.2750 335.7750 
3 61.2625 65.7625 43 337.2625 341.7625 
4 67.2625 71.7625 44 343.2625 347.7625 
5 79.2625 83.7625 45 349.2625 353.7625 
6 85.2625 89.7625 46 355.2625 359.7625 
7 175.2625 179.7625 47 361.2625 365.7625 
8 181.2625 185.7625 48 367.2625 371.7625 
9 187.2625 191.7625 49 373.2625 377.7625 
10 193.2625 197.7625 50 379.2625 383.7625 
11 199.2625 203.7625 51 385.2625 389.7625 
12 205.2625 209.7625 52 391.2625 395.7625 
13 211.2625 215.7625 53 397.2625 401.7625 
14 121.2625 125.7625 54 403.2625 407.7625 
15 127.2625 131.7625 55 409.2625 413.7625 
16 133.2625 137.7625 56 415.2625 419.7625 
17 139.2625 143.7625 57 421.2625 425.7625 
18 145.2625 149.7625 58 427.2625 431.7625 
19 151.2625 155.7625 59 433.2625 437.7625 
20 157.2625 161.7625 60 439.2625 443.7625 
21 163.2625 167.7625 61 445.2625 449.7625 
22 169.2625 173.7625 62 451.2625 455.7625 
23 217.2625 221.7625 63 457.2625 461.7625 
24 223.2625 227.7625 64 463.2625 467.7625 
25 229.2625 233.7625 65 469.2625 473.7625 
26 235.2625 239.7625 66 475.2625 479.7625 
27 241.2625 245.7625 67 481.2625 485.7625 
28 247.2625 251.7625 68 487.2625 491.7625 
29 253.2625 257.7625 69 493.2625 497.7625 
30 259.2625 263.7625 70 499.2625 503.7625 
31 265.2625 269.7625 71 505.2625 509.7625 
32 271.2625 275.7625 72 511.2625 915.7625 
33 277.2625 281.7625 73 917.2625 921.7625 
34 283.2625 287.7625 74 523.2625 527.7625 
35 289.2625 293.7625 75 529.2625 533.7625 
36 295.2625 299.7625 76 935.2625 539.7625 
37 301.2625 305.7625 17 941.2625 945.7625 
38 307.2625 311.7625 78 947.2625 991.7625 
39 313.2625 317.7625 719 593.2625 597.7625 


Table 8.3c. Analog Cable TV Nominal Frequencies for USA: Incrementally 
Related Carrier (IRC) Systems. 
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Cable Video Audio Cable Video Audio 
Channel Carrier Carrier Channel Carrier Carrier 
(MHz) (MHz) (MHz) (MHz) 
80 599.2625 963.7625 120 769.2625 773.7625 
81 565.2625 569.7625 121 775.2625 779.7625 
82 971.2625 975.7625 122 781.2625 785.7625 
83 977.2625 981.7625 123 787.2625 791.7625 
84 983.2625 587.7625 124 793.2625 797.7625 
85 589.2625 593.7625 125 799.2625 803.7625 
86 595.2625 599.7625 126 805.2625 809.7625 
87 601.2625 605.7625 127 811.2625 815.7625 
88 607.2625 611.7625 128 817.2625 821.7625 
89 613.2625 617.7625 129 823.2625 827.7625 
90 619.2625 623.7625 130 829.2625 833.7625 
91 625.2625 629.7625 131 835.2625 839.7625 
92 631.2625 635.7625 132 841.2625 845.7625 
93 637.2625 641.7625 133 847.2625 851.7625 
94 643.2625 647.7625 134 853.2625 857.7625 
95 91.2625 95.7625 135 859.2625 863.7625 
96 97.2625 101.7625 136 865.2625 869.7625 
97 103.2625 107.7625 137 871.2625 875.7625 
98 109.2750 113.7750 138 877.2625 881.7625 
99 115.2625 119.7625 139 883.2625 887.7625 
100 649.2625 653.7625 140 889.2625 893.7625 
101 655.2625 659.7625 141 895.2625 899.7625 
102 661.2625 665.7625 142 901.2625 905.7625 
103 667.2625 671.7625 143 907.2625 911.7625 
104 673.2625 677.7625 144 913.2625 917.7625 
105 679.2625 683.7625 145 919.2625 923.7625 
106 685.2625 689.7625 146 925.2625 929.7625 
107 691.2625 695.7625 147 931.2625 935.7625 
108 697.2625 701.7625 148 937.2625 941.7625 
109 703.2625 707.7625 149 943.2625 947.7625 
110 709.2625 713.7625 150 949.2625 953.7625 
111 715.2625 719.7625 151 955.2625 959.7625 
112 721.2625 725.7625 152 961.2625 965.7625 
113 727.2625 731.7625 153 967.2625 971.7625 
114 733.2625 737.7625 154 973.2625 977.7625 
115 739.2625 743.7625 155 979.2625 983.7625 
116 745.2625 749.7625 156 985.2625 989.7625 
117 751.2625 759.7625 157 991.2625 995.7625 
118 757.2625 761.7625 158 997.2625 1001.7625 
119 763.2625 767.7625 - - - 


Table 8.3d. Analog Cable TV Nominal Frequencies for USA: Incrementally 
Related Carrier (IRC) Systems. 
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Video Audio Video Audio 
Cable ; Cable ; . 

Channel Carrier Carrier Channel Carrier Carrier 

(MHz) (MHz) (MHz) (MHz) 
- - - 40 318.0159 322.5159 
1 72.0036 76.5036 41 324.0162 328.5162 
2 94.0027 58.9027 42 330.0165 334.5165 
3 60.0030 64.5030 43 336.0168 340.5168 
4 66.0033 70.5030 44 342.0168 346.5168 
5 72.0036 82.5039 45 348.0168 352.5168 
6 78.0039 88.5042 46 354.0168 358.5168 
7 174.0087 178.5087 47 360.0168 364.5168 
8 180.0090 184.5090 48 366.0168 370.5168 
9 186.0093 190.5093 49 372.0168 376.5168 
10 192.0096 196.5096 50 378.0168 382.5168 
11 198.0099 202.5099 51 384.0168 388.5168 
12 204.0102 208.5102 52 390.0168 394.5168 
13 210.0105 214.5105 53 396.0168 400.5168 
14 120.0060 124.5060 54 402.0201 406.5201 
15 126.0063 130.5063 55 408.0204 412.5204 
16 132.0066 136.5066 56 414.0207 418.5207 
17 138.0069 142.5069 57 420.0210 424.5210 
18 144.0072 148.5072 58 426.0213 430.5213 
19 150.0075 154.5075 59 436.5216 436.5216 
20 156.0078 160.5078 60 438.0219 442.5219 
21 162.0081 166.5081 61 444.0222 448.5222 
22 168.0084 172.5084 62 450.0225 454.5225 
23 216.0108 220.5108 63 456.0228 460.5228 
24 222.0111 226.5111 64 462.0231 466.5231 
25 228.0114 232.5114 65 468.0234 472.5234 
26 234.0117 238.5117 66 474.0237 478.5237 
27 240.0120 244.5120 67 480.0240 484.5240 
28 246.0123 250.5123 68 486.0243 490.5243 
29 252.0126 256.5126 69 492.0246 496.5246 
30 258.0129 262.5129 70 498.0249 502.5249 
31 264.0132 268.5132 71 504.0252 508.5252 
32 270.0135 274.5135 72 510.0255 914.5255 
33 276.0138 280.5138 73 516.0258 520.5258 
34 282.0141 286.5141 74 522.0261 526.5261 
35 288.0144 292.5144 75 528.0264 932.5264 
36 294.0147 298.5147 76 534.0267 938.5267 
37 300.0150 304.5150 17 540.0270 544.5270 
38 306.0153 310.5153 78 546.0273 590.5273 
39 312.0156 316.5156 719 592.0276 596.5276 


Table 8.3e. Analog Cable TV Nominal Frequencies for USA: Harmonically 
Related Carrier (HRC) systems. 
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Cable Video Audio Cable Video Audio 
Channel Carrier Carrier Channel Carrier Carrier 

(MHz) (MHz) (MHz) (MHz) 

80 598.0279 962.5279 120 768.0384 772.5384 
81 564.0282 568.5282 121 774.0387 778.5387 
82 570.0285 974.5285 122 780.0390 784.5390 
83 576.0288 580.5288 123 786.0393 790.5393 
84 982.0291 986.5291 124 792.0396 796.5396 
85 588.0294 592.5294 125 798.0399 802.5399 
86 594.0297 598.5297 126 804.0402 808.5402 
87 600.0300 604.5300 127 810.0405 814.5405 
88 606.0303 610.5303 128 816.0408 820.5408 
89 612.0306 616.5306 129 822.0411 826.5411 
90 618.0309 622.5309 130 828.0414 832.5414 
91 624.0312 628.5312 131 834.0417 838.5417 
92 630.0315 634.5315 132 840.0420 844.5420 
93 636.0318 640.5318 133 846.0423 850.5423 
94 642.0321 646.5321 134 852.0426 856.5426 
95 90.0045 94.5045 135 858.0429 862.5429 
96 96.0048 100.5048 136 864.0432 868.5432 
97 102.0051 106.5051 137 870.0435 874.5435 
98 - - 138 876.0438 880.5438 
99 - - 139 882.0441 888.5441 
100 648.0324 652.5324 140 888.0444 892.5444 
101 654.0327 658.5327 141 894.0447 898.5447 
102 660.0330 664.5330 142 900.0450 904.5450 
103 666.0333 670.5333 143 906.0453 910.5453 
104 672.0336 676.5336 144 912.0456 916.5456 
105 678.0339 682.5339 145 918.0459 922.5459 
106 684.0342 688.5342 146 924.0462 928.5462 
107 690.0345 694.5345 147 930.0465 934.5465 
108 696.0348 700.5348 148 936.0468 940.5468 
109 702.0351 706.5351 149 942.0471 946.5471 
110 708.0354 712.5354 150 948.0474 952.5474 
111 714.0357 718.5357 151 954.0477 958.5477 
112 720.0360 724.5360 152 960.0480 964.5480 
113 726.0363 730.5363 153 966.0483 970.5483 
114 732.0366 736.5366 154 972.0486 976.5486 
115 738.0369 742.5369 155 978.0489 982.5489 
116 744.0372 748.5372 156 984.0492 988.5492 
117 750.0375 754.5375 157 990.0495 994.5495 
118 756.0378 760.5378 158 996.0498 1000.5498 

119 762.0381 766.5381 - - - 


Table 8.3f. Analog Cable TV Nominal Frequencies for USA: Harmonically 
Related Carrier (HRC) systems. 
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Cable Video Audio Channel Cable Video Audio Channel 
Channel Carrier Carrier Range Channel Carrier Carrier Range 
(MHz) (MHz) (MHz) (MHz) (MHz) (MHz) 
- - - - 40 325.25 329.75 324-330 
- - - - 41 331.25 335.75 330-336 
- - - - 42 337.25 341.75 336-342 
13 109.25 113.75 108-114 46 343.25 347.75 342-348 
14 115.25 119.75 114-120 44 349.25 353.75 348-354 
15 121.25 125.75 120-126 45 355.25 359.75 354-360 
16 127.25 131.75 126-132 46 361.25 365.75 360-366 
17 133.25 137.75 132-138 47 367.25 371.75 366-372 
18 139.25 143.75 138-144 48 373.25 377.75 372-378 
19 145.25 149.75 144-150 49 379.25 383.75 378-384 
20 151.25 155.75 150-156 50 385.25 389.75 384-390 
21 157.25 161.75 156-162 51 391.25 395.75 390-396 
22 165.25 169.75 164-170 52 397.25 401.75 396-402 
23 223.25 227.75 222-228 53 403.25 407.75 402-408 
24 231.25 235.75 230-236 54 409.25 413.75 408-414 
25 237.25 241.75 236-242 55 415.25 419.75 414-420 
26 243.25 247.75 242-248 56 421.25 425.75 420-426 
27 249.25 253.75 248-254 57 427.25 431.75 426-432 
28 253.25 297.75 252-258 58 433.25 437.75 432-438 
29 259.25 263.75 258-264 59 439.25 443.75 438-444 
30 265.25 269.75 264-270 60 445.25 449.75 444-450 
31 271.25 275.75 270-276 61 451.25 455.75 450-456 
32 277.25 281.75 276-282 62 457.25 461.75 456-462 
33 283.25 287.75 282-288 63 463.25 467.75 462-468 
34 289.25 293.75 288-294 - - - - 
35 295.25 299.75 294-300 - - - - 
36 301.25 305.75 300-306 - - - - 
37 307.25 311.75 306-312 - - - - 
38 313.25 317.75 312-318 - - - - 
39 319.25 323.75 318-324 - - - - 


Table 8.4. Analog Cable TV Nominal Frequencies for Japan. 
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The following countries use the (M) NTSC 


standard. 


Antigua 
Aruba 
Bahamas 
Barbados 
Belize 
Bermuda 
Bolivia 
Canada 
Chile 
Colombia 
Costa Rica 
Cuba 
Curacao 
Dominican Republic 
Ecuador 

El Salvador 
Guam 
Guatemala 
Honduras 
Jamaica 


Japan (NTSC-J) 

Korea, South 

Mexico 

Montserrat 

Myanmar 

Nicaragua 

Panama 

Peru 

Philippines 

Puerto Rico 

St. Kitts and Nevis 

Samoa 

Suriname 

Taiwan 

Trinidad/Tobago 

United States of 
America 

Venezuela 

Virgin Islands 


Luminance Equation Derivation 


The equation for generating luminance from 
RGB is determined by the chromaticities of the 
three primary colors used by the receiver and 


what color white actually is. 


The chromaticities of the RGB primaries 
and reference white (CIE illuminate C) were 
specified in the 1953 NTSC standard to be: 


R: x,=0.67 y,=0.33 z,=0.00 
G: Xg=0.21 yg=0.71 Zz, =0.08 


B: X,=0.14 yp = 


white: Xy, = 0.3101 
Zw = 0.3737 


0.08 2, = 0.78 
Yw = 0.3162 


where x and y are the specified CIE 1931 chro- 
maticity coordinates; z is calculated by know- 
ing thatx+y+z=1. 

Luminance is calculated as a weighted sum 
of RGB, with the weights representing the 
actual contributions of each of the RGB prima- 
ries in generating the luminance of reference 
white. We find the linear combination of RGB 
that gives reference white by solving the equa- 
tion: 


x, Xg xy K, XI 
Vie D7) AG =| I 
by By &y) [Ky 2y/Vyy 


Rearranging to solve for K,, Kg, and Ky yields: 


-1 


K, Kyl Iy| [Xr Xp Xp 
K, - 1 Jy Vg Yo 
K, Zw/Iw| |2y 2g 2 


Substituting the known values gives us the 
solution for K,, Kg, and K;;: 


-1 
K, 0.3101/0.3162| |0.67 0.21 0.14 
K,| = 1 0.33 0.71 0.08 


K, 0.3737/0.3162) {0.00 0.08 0.78 


0.9807} | 1.730 -—0.482 -0.261 
= 1 —0.814 1.652 -0.023 
1.1818} | 0.083 -0.169 1.284 


0.906 
=|0.827 
1.430 
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Y is defined to be 


Y= (K,y,)R’ + (Kgyg)G’ + (Kpyp) B’ 
= (0.906) (0.33)R’ + (0.827) (0.71) G’ 
+ (1.430) (0.08) B’ 


or 


Y = 0.299R’ + 0.587G’ + 0.114B’ 


Modern receivers use a different set of 
RGB phosphors, resulting in slightly different 
chromaticities of the RGB primaries and refer- 
ence white (CIE illuminate D¢;): 


R: x,= 0.630 y,=0.340 z= 0.030 
G: x,=0310 yg=0.595 z= 0.095 
B: x,=0.155 y,=0.070 2, = 0.775 
white: X_=0.3127 yy = 0.3290 

2, = 0.3583 


where x and y are the specified CIE 1931 chro- 
maticity coordinates; z is calculated by know- 
ing that x + y + z=1. Once again, substituting 
the known values gives us the solution for K,, 
Kg, and K;;: 


r - 
K, 0.3127/0.3290] |0.630 0.310 0.155 
K,| = 1 0.340 0.595 0.070 
K, | 0.3583/0.3290} |0.030 0.095 0.775 


10,6243 
= 11.1770 
1.2362 


Since Y is defined to be 


Y = (K,y,)R’ + Kgyg)G’ + (Kpyp) B’ 
= (0.6243) (0.340)R’ + (1.1770) (0.595) G’ 
+ (1.2362) (0.070)B’ 


this results in: 


Y = 0.212R’ + 0.700G’ + 0.086B’ 


However, the standard Y = 0.299R’ + 
0.587G’ + 0.114B’ equation is still used. Adjust- 
ments are in the receiver to minimize color 
errors. 


PAL Overview 


Europe delayed adopting a color television 
standard, evaluating various systems between 
1953 and 1967 that were compatible with their 
625-line, 50-field-per-second, 2:1 interlaced 
monochrome standard. The NTSC specifica- 
tion was modified to overcome the high order 
of phase and amplitude integrity required dur- 
ing broadcast to avoid color distortion. The 
Phase Alternation Line (PAL) system imple- 
ments a line-by-line reversal of the phase of 
one of the color components, originally relying 
on the eye to average any color distortions to 
the correct color. Broadcasting began in 1967 
in Germany and the United Kingdom, with 
each using a slightly different variant of the 
PAL system. 
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Luminance Information 


The monochrome luminance (Y) signal is 
derived from R’G’B’: 


Y = 0.299R’ + 0.587G' + 0.114B’ 


As with NTSC, the luminance signal occu- 
pies the entire video bandwidth. PAL has sev- 
eral variations, depending on the video 
bandwidth and placement of the audio subcar- 
rier. The composite video signal has a band- 
width of 4.2, 5.0, 5.5, or 6.0 MHz, depending on 
the specific PAL standard. 


Color Information 


To transmit color information, U and V are 
used: 


U = 0.492(B’ -Y) 
V = 0.877(R’ - Y) 
U and V have a typical bandwidth of 1.3 MHz. 


Color Modulation 


As in the NTSC system, U and V are used to 
modulate the color subcarrier using two bal- 
anced modulators operating in phase quadra- 
ture: one modulator is driven by the subcarrier 
at sine phase, the other modulator is driven by 
the subcarrier at cosine phase. The outputs of 
the modulators are added together to form the 
modulated chrominance signal: 


C=U sin at + Vcos at 


@ = 2nF sc 


Foc = 4.43361875 MHz (+ 5 Hz) 
for (B, D, G, H, I, N) PAL 


Fsc = 3.58205625 MHz (+ 5 Hz) for 
(No) PAL 


Fg¢ = 3.57561149 MHz (+ 10 Hz) for 
(M) PAL 


In PAL, the phase of V is reversed every 
other line. V was chosen for the reversal pro- 
cess since it has a lower gain factor than U and 
therefore is less susceptible to a one-half Fy 
switching rate imbalance. The result of alter- 
nating the V phase at the line rate is that any 
color subcarrier phase errors produce comple- 
mentary errors, allowing line-to-line averaging 
at the receiver to cancel the errors and gener- 
ate the correct hue with slightly reduced satu- 
ration. This technique requires the PAL 
receiver to be able to determine the correct V 
phase. This is done using a technique known 
as AB sync, PAL sync, PAL Switch, or “swing- 
ing burst,” consisting of alternating the phase 
of the color burst by +45° at the line rate. The 
UV vector diagrams are shown in Figures 8.11 
and 8.12. 

“Simple” PAL decoders rely on the eye to 
average the line-by-line hue errors. “Standard” 
PAL decoders use a 1-H delay line to separate 
U from V in an averaging process. Both imple- 
mentations have the problem of Hanover bars, 
in which pairs of adjacent lines have a real and 
complementary hue error. Chrominance verti- 
cal resolution is reduced as a result of the line 
averaging process. 


Composite Video Generation 


The modulated chrominance is added to the 
luminance information along with appropriate 
horizontal and vertical sync signals, blanking 
signals, and color burst signals, to generate the 
composite color video waveform shown in Fig- 
ure 8.13. 


composite PAL = Y + U sin at 
+Vcos ot+ timing 
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IRE SCALE UNITS 


MAGENTA 
61° 


YELLOW 
167° 


Figure 8.11. UV Vector Diagram for 75% Color Bars. Line [n], 
PAL Switch = zero. 


IRE SCALE UNITS 


+V 
90° 
CYAN 
77 


YELLOW 
193° 


MAGENTA 
300° 


Figure 8.12. UV Vector Diagram for 75% Color Bars. Line [n 
+ 1], PAL Switch = one. 
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WHITE LEVEL 
100 IRE 
COLOR BURST LJ 
(10 + 1 CYCLES) 
21.43 IRE 
| BLACK / BLANK LEVEL 
21.43 IRE 
43 IRE 
' SYNC LEVEL 
COLOR 
LUMINANCE LEVEL Sar RATION 
PHASE = HUE 


Figure 8.13. (B, D, G, H, |, Nc) PAL Composite 
Video Signal for 75% Color Bars. 
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The bandwidth of the resulting composite 
video signal is shown in Figure 8.14. 

Like NTSC, the luminance components are 
spaced at Fy intervals due to horizontal blank- 
ing. Since the V component is switched sym- 
metrically at one-half the line rate, only odd 
harmonics are generated, resulting in V com- 
ponents that are spaced at intervals of Fy. The 
V components are spaced at half-line intervals 
from the U components, which also have Fy 
spacing. If the subcarrier had a half-line offset 
like NTSC uses, the U components would be 
perfectly interleaved, but the V components 
would coincide with the Y components and 
thus not be interleaved, creating vertical sta- 
tionary dot patterns. For this reason, PAL uses 
a 1/4 line offset for the subcarrier frequency: 


for (B, D, G, H, I, N) PAL 


Foc = (909/4) Fy 
for (M) PAL 


Fsc¢ = ((917/4) a (1/625)) Fy 
for (Nc) PAL 


The additional (1/625) Fy factor (equal to 
25 Hz) provides motion to the color dot pat- 
tern, reducing its visibility. Figure 8.15 illus- 
trates the resulting frequency interleaving. 
Eight complete fields are required to repeat a 
specific sample position, as shown in Figures 
8.16 and 8.17. 


PAL Variations 


There is a variation of PAL, “noninterlaced 
PAL,” shown in Figure 8.18. It is a 312-line, 50 
frames-per-second version of PAL common 
among video games and on-screen displays. 
This format is identical to standard PAL, 
except that there are 312 lines per frame. 


The most common PAL standards are 
shown in Figure 8.19. 


RF Modulation 


Figures 8.20 and 8.21 illustrate the process of 
converting baseband (G) PAL composite video 
to a RF (radio frequency) signal. The process 
for the other PAL standards is similar, except 
primarily the different video bandwidths and 
subcarrier frequencies. 

Figure 8.20a shows the frequency spec- 
trum of a (G) PAL baseband composite video 
signal. It is similar to Figure 8.14. However, Fig- 
ure 8.14 only shows the upper sideband for 
simplicity. The “video carrier” notation at 0 
MHz serves only as a reference point for com- 
parison with Figure 8.20b. 

Figure 8.20b shows the audio/video signal 
as it resides within an 8-MHz channel. The 
video signal has been lowpass filtered, most of 
the lower sideband has been removed, and 
audio information has been added. Note that 
(H) and (1) PAL have a vestigial sideband of 
1.25 MHz, rather than 0.75 MHz. 

Figure 8.20c details the information 
present on the audio subcarrier for analog ste- 
reo operation. 

As shown in Figure 8.21, back porch clamp- 
ing of the analog video signal ensures that the 
back porch level is constant, regardless of 
changes in the average picture level. The video 
signal is then lowpass filtered to 5.0 MHz and 
drives the AM (amplitude modulation) video 
modulator. The sync level corresponds to 100% 
modulation; the blanking and white modulation 
levels are dependent on the specific version of 
PAL: 
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CHROMINANCE 
AMPLITUDE SUBCARRIER 
4 4 
Y U U 
+V +V 
oe, FREQUENCY 
T T T T T T (MHz) 
0.0 1.0 2.0 3.0 4.0 4.43 5.0 5.5 
(I) PAL 
CHROMINANCE 
AMPLITUDE SUBCARRIER 
4 4 
ay U U 
+V +V 
FREQUENCY 
T T T T T (MHZ) 
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(B, G,H) PAL 


Figure 8.14. Video Bandwidths of Some PAL Systems. 


Figure 8.15. Luma and Chroma Frequency Interleave Principle. 
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3LANKING FIELD TWO 
NTERVALS 
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BURST PHASE = REFERENCE PHASE = 135° RELATIVE TO U 
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+V COMPONENT 


—V COMPONENT 


| BURST PHASE = REFERENCE PHASE + 90° = 225° RELATIVE TO U 


PAL SWITCH = 1, 


Figure 8.16a. Eight-field (B, D, G, H, |, Nc) PAL Sequence and Burst Blanking. 
See Figure 8.5 for equalization and serration pulse details. 
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| BURST PHASE = REFERENCE PHASE + 90° = 225° RELATIVE TO U 


Figure 8.16b. Eight-field (B, D, G, H, I, Nc) PAL Sequence and Burst Blanking. 
See Figure 8.5 for equalization and serration pulse details. 
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520 521 522 523 524 525 1 2 5 
ANALOG 
ry | ry 
FIELD 4/8 
\ 
VA : IX. Nh ' 
258 | 259 260 261 262 263 264 265 266 267 268 269 270 271 272 


4 BURST PHASE = REFERENCE PHASE = 135° RELATIVE TO U 
PAL SWITCH = 0, +V COMPONENT 


BURST PHASE = REFERENCE PHASE + 90° = 225° RELATIVE TO U 
PAL SWITCH = 1, —V COMPONENT 


Figure 8.17. Eight-field (M) PAL Sequence and Burst Blanking. 


See Figure 8.5 for equalization and serration pulse details. 
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START 


308 | 309 310 311 312 1 2 


| BURST PHASE = REFERENCE PHASE = 135° RELATIVE TO U 


PAL SWITCH = 0, +V COMPONENT 


| BURST PHASE = REFERENCE PHASE + 90° = 225° RELATIVE TO U 


PAL SWITCH = 1, —V COMPONENT 


Figure 8.18. Noninterlaced PAL Frame Sequence. 


blanking level (% modulation) 


B,G 79% 
D,H,M,N 75% 
I 76% 


white level (% modulation) 


B,G,H,M,N 10% 
D 10% 
I 20% 


Note that PAL systems use a variety of 
video and audio IF frequencies (values in 
MHz): 


video audio 


B,G = 38.900 33.400 

B 36.875 31.375 — Australia 
D 37.000 30.500 China 
D 38.900 32.400  OIRT 

I 38.900 32.900 

I 39.500 33.500 U.K. 
M,N 45.750 41.250 


At this point, audio information is added on 
the audio subcarrier. A monaural L+R audio 
signal is processed as shown in Figure 8.21 
and drives the FM (frequency modulation) 
modulator. The output of the FM modulator is 
added to the IF video signal. 

The SAW filter, used as a vestigial side- 
band filter, provides filtering of the IF signal. 
The mixer, or up converter, mixes the IF signal 
with the desired broadcast frequency. Both 
sum and difference frequencies are generated 
by the mixing process, so the difference signal 
is extracted by using a bandpass filter. 


Stereo Audio (Analog) 
The implementation of analog stereo audio 
(ITU-R BS.707), also known as Zweiton or A2, 
is shown in Figure 8.21. The L+R information 
is transmitted on a FM subcarrier. The R infor- 
mation, or a second L+R audio signal, is trans- 
mitted on asecond FM subcarrier at +15.5Fy. 
If stereo or dual mono signals are present, 
the FM subcarrier at +15.5Fy is amplitude- 
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QUADRATURE MODULATED SUBCARRIER 
PHASE = HUE 
AMPLITUDE = SATURATION 
LINE ALTERNATION OF V COMPONENT 


a teen | een ee |e 


sd "B, B1, G, H" "M" 

LINE/ FIELD = 625/ 50 LINE/ FIELD = 625/ 50 LINE/ FIELD = 525/ 59.94 

FH = 15.625 KHZ FH = 15.625 KHZ FH = 15.734 KHZ 

FV = 50 HZ FV = 50 HZ FV = 59.94 HZ 

FSC = 4.43361875 MHZ FSC = 4.43361875 MHZ FSC = 3.57561149 MHZ 
BLANKING SETUP = 0 IRE BLANKING SETUP = 0 IRE BLANKING SETUP = 7.5 IRE 
VIDEO BANDWIDTH = 5.5 MHZ VIDEO BANDWIDTH = 5.0 MHZ VIDEO BANDWIDTH = 4.2 MHZ 
AUDIO CARRIER = 5.9996 MHZ AUDIO CARRIER = 5.5 MHZ AUDIO CARRIER = 4.5 MHZ 
CHANNEL BANDWIDTH = 8 MHZ CHANNEL BANDWIDTH: CHANNEL BANDWIDTH = 6 MHZ 

B = 7 MHZ 


B1,G,H = 8 MHZ 


"D" "N" "Nc" 

LINE/ FIELD = 625/ 50 LINE/ FIELD = 625/ 50 LINE/ FIELD = 625/ 50 

FH = 15.625 KHZ FH = 15.625 KHZ FH = 15.625 KHZ 

FV = 50 HZ FV = 50 HZ FV = 50 HZ 

FSC = 4.43361875 MHZ FSC = 4.43361875 MHZ FSC = 3.58205625 MHZ 
BLANKING SETUP = 0 IRE BLANKING SETUP = 7.5 IRE BLANKING SETUP = 0 IRE 
VIDEO BANDWIDTH = 6.0 MHZ VIDEO BANDWIDTH = 5.0 MHZ VIDEO BANDWIDTH = 4.2 MHZ 
AUDIO CARRIER = 6.5 MHZ AUDIO CARRIER = 5.5 MHZ AUDIO CARRIER = 4.5 MHZ 
CHANNEL BANDWIDTH = 8 MHZ CHANNEL BANDWIDTH = 6 MHZ CHANNEL BANDWIDTH = 6 MHZ 


Figure 8.19. Common PAL Systems. 
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VIDEO 
CARRIER 
CHROMINANCE CHROMINANCE 
SUBCARRIER SUBCARRIER 
| | | = FREQUENCY (MHZ) 
55 -5.0 -4.43 -1.0 0.0 1.0 443 5.0 5.5 
(A) 
VIDEO 
CARRIER 
CHROMINANCE 
0.75 MHZ SUBCARRIER 
VESTIGIAL 
SIDEBAND AUDIO 
CARRIER 
TI T = FREQUENCY (MHZ) 
-4.0 -3.0 -0.75 0.0 1.0 443 5.0 5.5 
8 MHZ CHANNEL 
Le ral 
=1.25 6.75 
(B) 
AUDIO 
CARRIER FH = 15,625 HZ 
\ 
L+R R 
(FM) (FM) 
1 | | > FREQUENCY 
-50 KHZ 0.0 50 KHZ assFH 153FH  ys5FH 
50 KHZ +50 KHZ 
(C) 


Figure 8.20. Transmission Channel for (G) PAL. (a) Frequency spectrum of baseband composite 


video. (b) Frequency spectrum of typical channel including audio information. (c) Detailed 
frequency spectrum of Zweiton analog stereo audio information. 
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modulated with a 54.6875 kHz (3.5Fy) subcar- 
rier. This 54.6875 kHz subcarrier is 50% ampli- 
tude-modulated at 117.5 Hz (Fy / 133) to 
indicate stereo audio or 274.1 Hz (Fy / 57) to 
indicate dual mono audio. 

Countries that use this system include 
Australia, Austria, China, Germany, Italy, 
Malaysia, Netherlands, Slovenia, and Switzer- 
land. 


Stereo Audio (Digital) 

The implementation of digital stereo audio 
uses NICAM 728 (Near Instantaneous Com- 
panded Audio Multiplex), discussed within 
BS.707 and ETSI EN 300 163. It was developed 
by the BBC and IBA to increase sound quality, 
provide multiple channels of digital sound or 


data, and be more resistant to transmission 
interference. 

The subcarrier resides either 5.85 MHz 
above the video carrier for (B, D, G, H) PAL 
and (L) SECAM systems or 6.552 MHz above 
the video carrier for (I) PAL systems. 

Countries that use NICAM 728 include 
Belgium, China, Denmark, Finland, France, 
Hungary, New Zealand, Norway, Singapore, 
South Africa, Spain, Sweden, and the United 
Kingdom. 

NICAM 728 is a digital system that uses a 
32-kHz sampling rate and 14-bit resolution. A 
bit rate of 728 kbps is used, giving it the name 
NICAM 728. Data is transmitted in frames, 
with each frame containing 1 ms of audio. As 
shown in Figure 8.22, each frame consists of: 


117.5 HZ 
274.1 HZ 


| 


STEREO 3.5FH 


PILOT SIGNAL 


AM 


MODULATOR 


50 pS 
PRE-EMPHASIS 


FM AM 
—| 
MODULATOR 


MODULATOR 


33.4 MHZ — 15.5FH 
IF AUDIO CARRIER 


50 pS 
PRE-EMPHASIS 


AUDIO RIGHT 


AUDIO LEFT 


i -©) 
MODULATOR 


40-15,000 HZ BPF 


BACK 
PORCH 
CLAMP 


(G) PAL 
COMPOSITE ———=| 
VIDEO 


5.0 MHZ AM 


38.9 MHZ 
IF VIDEO CARRIER 


33.4 MHZ 
IF AUDIO CARRIER 


z SAW 
— 
LPF MODULATOR 


BANDPASS 
FILTER 


MIXER 
(UP CONVERTER) 


MODULATED RF 


— 


AUDIO / VIDEO 


— 
FILTER / 


33.15-40.15 MHZ 


BANDWIDTH CHANNEL 
SELECT 


Figure 8.21. Typical RF Modulation Implementation for (G) PAL: Zweiton Stereo Audio. 


PAL Overview 275 


8-bit frame alignment word (01001110) 
5 control bits (CO-C4) 

11 undefined bits (ADO-AD10) 

704 audio data bits (A000-A703) 


CO is a “1” for eight successive frames and 
a “0” for the next eight frames, defining a 16- 
frame sequence. C1-C3 specify the format 
transmitted: “000” = one stereo signal with the 
left channel being odd-numbered samples and 
the right channel being even-numbered sam- 
ples, “010” = two independent mono channels 
transmitted in alternate frames, “100” = one 
mono channel and one 352 kbps data channel 
transmitted in alternate frames, “110” = one 
704 kbps data channel. C4 is a “1” if the analog 
sound is the same as the digital sound. 


Stereo Audio Encoding 

The thirty-two 14-bit samples (1 ms of audio, 
2’s complement format) per channel are pre- 
emphasized to the ITU-T J.17 curve. 

The largest positive or negative sample of 
the 32 is used to determine which 10 bits of all 
32 samples to transmit. Three range bits per 
channel (RO;, R1,, R2,, and ROp, Rlp, R2p) are 
used to indicate the scaling factor. D13 is the 
sign bit (“O” = positive). 


D13—D0 R2—-RO Bits Used 
O1xxxXxXxXXXXXXXX LAT D13, D12—-D4 
OO1xXxXxXxXxXXxXXXXX 110 D13, D11—-D3 
OOO1LXxXxXxXXxXXXXX 101 D13, D10—-D2 
OO0OO001xXxXxXxXxXxXxXxXX O11 D13, D9-D1 
O00001xXxXxXxxXxXxXxX 101 D13, D8—DO 
O0000001LxXxxxxXxx 010 D13, D8—DO 
OO000000xxxxxXxx 00x D13, D8—DO 
1111111xxxxxxx 00x D13, D8—DO 
1111110xxxxxxx 010 D13, D8—DO 
111110xxxxxxxx 100 D13, D8—DO 
11110xxxxxxxxx 011 Di3, DI=-DL 
1110xxxxXxXxXxXxXxXX 101 D13, D10—-—D2 
110xxxxXxXXxXXXXX 110 D13, D11—-D3 
1LOXxXxXXXXXXXXXK La D13, D12—D4 


A parity bit for the six MSBs of each sam- 
ple is added, resulting in each sample being 11 
bits. The 64 samples are interleaved, generat- 
ing LO, RO, L1, R1, L2, R2, ... L31, R31, and 
numbered 0-63. 

The parity bits are used to convey to the 
decoder what scaling factor was used for each 
channel (“signalling-in-parity”). 


If R2; = “0,” even parity for samples 
0, 6, 12, 18, ... 48 is used. If R2;, = “1,” 
odd parity is used. 


If R2p = “0,” even parity for samples 
1, 7, 13, 19, ... 49 is used. If R2p = “1,” 
odd parity is used. 


If Rl, = “0,” even parity for samples 
2, 8, 14, 20, ... 50 is used. If R1;, = “1,” 
odd parity is used. 


If Rlp = “0,” even parity for samples 
3, 9, 15, 21, ... 51 is used. If Rlp = “1,” 
odd parity is used. 


If RO; = “0,” even parity for samples 
4, 10, 16, 22, ... 52 is used. If ROz, = “1,” 
odd parity is used. 


If ROp = “0,” even parity for samples 
5, 11, 17, 23, ... 53 is used. If ROp = “1,” 
odd parity is used. 


The parity of samples 54-63 normally have 
even parity. However, they may be modified to 
transmit two additional bits of information: 


If CIBO = “0,” even parity for samples 
54, 55, 56, 57, and 58 is used. If CIBO = 
“1,” odd parity is used. 


If CIB1 = “0,” even parity for samples 
59, 60, 61, 62, and 63 is used. If CIB1 = 
“1,” odd parity is used. 


276 = Chapter 8: NTSC, PAL, and SECAM Overview 


The audio data is bit-interleaved as shown 
in Figure 8.22 to reduce the influence of drop- 
outs. If the bits are numbered 0-703, they are 
transmitted in the order 0, 44, 88, ... 660, 1, 45, 
89, ... 661, 2, 46, 90, ... 703. 

The whole frame, except the frame align- 
ment word, is exclusive-ORed with a 1-bit 
pseudo-random binary sequence (PRBS). The 
PRBS generator is reinitialized after the frame 
alignment word of each frame so that the first 
bit of the sequence processes the CO bit. The 
polynomial of the PRBS is x? + x* + 1 with an 
initialization word of “111111111.” 

Actual transmission consists of taking bits 
in pairs from the 728 kbps bitstream, then gen- 
erating 356k symbols per second using Differ- 
ential Quadrature Phase-Shift Keying 
(DQPSK). If the symbol is “00,” the subcarrier 
phase is left unchanged. If the symbol is “01,” 
the subcarrier phase is delayed 90°. If the sym- 
bol is “11,” the subcarrier phase is inverted. If 
the symbol is “10,” the subcarrier phase is 
advanced 90°. 

Finally, the signal is spectrum-shaped to a 
-30 dB bandwidth of ~700 kHz for (I) PAL or 
~500 kHz for (B, G) PAL. 


FRAME 
ALIGNMENT CONTROL 
WORD BITS 


ADDITIONAL DATA BITS 


Stereo Audio Decoding 

A PLL locks to the NICAM subcarrier fre- 
quency and recovers the phase changes that 
represent the encoded symbols. The symbols 
are decoded to generate the 728 kbps bit- 
stream. 

The frame alignment word is found and 
the following bits are exclusive-ORed with a 
locally-generated PRBS to recover the packet. 
The C0 bit is tested for 8 frames high, 8 frames 
low behavior to verify it is a NICAM 728 bit- 
stream. 

The bit-interleaving of the audio data is 
reversed, and the “signalling-in-parity” 
decoded: 


A majority vote is taken on the parity 
of samples 0, 6, 12, ... 48. If even, R2; = 
“0”; ifodd, R2, = “1.” 


A majority vote is taken on the parity 
of samples 1, 7, 13, ... 49. If even, R2p = 
“0”; ifodd, R2p = “1.” 


A majority vote is taken on the parity 
of samples 2, 8, 14, ... 50. If even, Rly, = 
“0”; if odd, R1y, = “1.” 


704 BITS 
AUDIO DATA 


0, 1, 0, 0, 1, 1, 1, 0, CO, C1, C2, C3, C4, ADO, AD1, AD2, AD3, AD4, AD5, AD6, AD7, AD8, AD9, AD10, 


A000, A044, A088, ... A660, 
A001, A045, A089, ... A661, 
A002, A046, A090, ... A662, 
A003, A047, A091, ... A663, 


A043, A087, A131, ... A703 


Figure 8.22. NICAM 728 Bitstream for One Frame. 
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A majority vote is taken on the parity 
of samples 3, 9, 15, ... 51. If even, Rlp = 
“0”; if odd, Rlp = “1.” 


A majority vote is taken on the parity 
of samples 4, 10, 16, ... 52. If even, RO; = 
“0”; if odd, RO; = “1.” 


A majority vote is taken on the parity 
of samples 5, 11, 17, ... 53. Ifeven, ROp = 
“0”; if odd, ROp = “1.” 


A majority vote is taken on the parity 
of samples 54, 55, 56, 57, and 58. If even, 
CIBO = “0”; if odd, CIBO = “1.” 


A majority vote is taken on the parity 
of samples 59, 60, 61, 62, and 63. If even, 
CIB1 = “0”; if odd, CIB1 = “1.” 


Any samples whose parity disagreed with 
the vote are ignored and replaced with an 
interpolated value. 

The left channel uses range bits R2;, R1,, 
and RO; to determine which bits below the 
sign bit were discarded during encoding. The 
sign bit is duplicated into those positions to 
generate a 14-bit sample. 

The right channel is similarly processed, 
using range bits R2p, Rlp, and ROp. Both chan- 
nels are then de-emphasized using the J.17 
curve. 


Dual Mono Audio Encoding 

Two blocks of thirty-two 14-bit samples (2 ms 
of audio, 2’s complement format) are pre- 
emphasized to the ITU-T J.17 specification. As 
with the stereo audio, three range bits per 
block (RO,, Rla, R2,, and RO, Rlp, R2p) are 
used to indicate the scaling factor. Unlike ste- 
reo audio, the samples are not interleaved. 


If R2, = “0,” even parity for samples 
0, 3, 6, 9, ... 24 is used. If R2 q = “1,” odd 
parity is used. 


If R2p = “0,” even parity for samples 
27, 30, 33, ... 51 is used. If R2p = “1,” odd 
parity is used. 


If Rl, = “0,” even parity for samples 
1, 4, 7, 10, ... 25 is used. If Rl, = “1,” odd 
parity is used. 


If Rlp = “0,” even parity for samples 
28, 31, 34, ... 52 is used. If Rlp = “1,” odd 
parity is used. 


If RO, = “0,” even parity for samples 
2, 5, 8, 11, ... 26 is used. If RO, = “1,” odd 
parity is used. 


If ROp = “0,” even parity for samples 
29, 32, 35, ... 53 is used. If ROp = “1,” odd 
parity is used. 


The audio data is bit-interleaved; however, 
odd packets contain 64 samples of audio chan- 
nel 1 while even packets contain 64 samples of 
audio channel 2. The rest of the processing is 
the same as for stereo audio. 


Analog Channel Assignments 


Tables 8.5 through 8.7 list the channel assign- 
ments for VHF, UHE and cable for various PAL 
systems. 

Note that cable systems routinely reassign 
channel numbers to alternate frequencies to 
minimize interference and provide multiple 
levels of programming (such as two versions of 
a premium movie channel: one for subscribers, 
and one for nonsubscribers during pre-view 
times). 
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Video Audio Channel Video Audio Channel 
Channel Carrier Carrier Range Channel Carrier Carrier Range 
(MHz) (MHz) (MHz) (MHz) (MHz) (MHz) 
(B) PAL, Australia, 7 MHz Channel (B) PAL, Italy, 7 MHz Channel 
0 46.25 51.75 45-52 A 53.75 59.25 52.5-99.5 
1 97.25 62.75 56-63 B 62.25 67.75 61-68 
2 64.25 69.75 63-70 C 82.25 87.75 81-88 
3 86.25 91.75 85-92 D 175.25 180.75 174-181 
4 95.25 100.75 94-101 E 183.75 189.25 182.5-189.5 
5 102.25 107.75 101-108 F 192.25 197.75 191-198 
5A 138.25 143.75 137-144 G 201.25 206.75 200-207 
6 175.25 180.75 174-181 H 210.25 215.75 209-216 
7 182.25 187.75 181-188 H-1 217.25 222.75 216-223 
8 189.25 194.75 188-195 H-2 224.25 229.75 223-230 
9 196.25 201.75 195-202 - - - - 
10 209.25 214.75 208-215 - - - - 
11 216.25 221.75 215-222 - - - - 
12 223.25 - - - - 
(I) PAL, Ireland, 8 MHz Channel (B) PAL, New Zealand, 7 MHz Channel 
1 45.75 51.75 44.5-52.5 1 45.25 50.75 44-51 
2 53.75 59.75 52.5-60.5 2 59.25 60.75 54-61 
3 61.75 67.75 60.5-68.5 3 62.25 67.75 61-68 
4 175.25 181.25 174-182 4 175.25 180.75 174-181 
5 183.25 189.25 182-190 5 182.25 187.75 181-188 
6 191.25 197.25 190-198 6 189.25 194.75 188-195 
7 199.25 205.25 198-206 7 196.25 201.75 195-202 
8 207.25 213.25 206-214 8 203.25 208.75 202-209 
9 215.25 221.25 214-222 9 210.25 215.75 209-216 


Table 8.5. Analog Broadcast and Cable TV Nominal Frequencies for (B, 1) PAL in Various Countries. 
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Audio 
3 Carrier 

Broadcast - Lay (MHz) “ anne! 

Channel arrier ange 

(MHz) (MHz) 

(G, H) PAL (1) PAL 

21 45.75 91.25 51.75 44,.5-52.5 
31 93.75 59.25 59.75 52.0-60.5 
41 61.75 67.25 67.75 60.5-68.5 
51 175.25 180.75 181.25 174-182 
6! 183.25 188.75 189.25 182-190 
7 191.25 196.75 197.25 190-198 
8i 199.25 204.75 205.25 198-206 
gi 207.25 212.75 213.25 206-214 
10! 215.25 220.75 221.25 214-222 

2? 48.25 53.75 - 47-54 

3? 59.29 60.75 - 94-61 

A 62.25 67.75 - 61-68 
52 175.25 180.75 - 174-181 
62 182.25 187.75 - 181-188 
72 189.25 194.75 - 188-195 
8? 196.25 201.75 - 195-202 
9? 203.25 208.75 - 202-209 
102 210.25 215.75 - 209-216 
112 217.25 222.75 - 216-223 
12? 224.25 229.75 - 223-230 
21 471.25 476.75 477.25 470-478 
22 479.25 484.75 485.25 478-486 
23 487.25 492.75 493.25 486-494 
24 495.25 500.75 901.25 494-502 
25 503.25 508.75 509.25 502-510 
26 911.25 916.75 917.25 510-518 
27 519.25 924.75 925.25 518-526 
28 927.25 932.75 533.25 526-534 
29 935.25 940.75 941.25 934-542 
30 943.25 948.75 949.25 542-550 
31 991.25 596.75 597.29 550-558 
32 599.25 564.75 565.25 5598-566 
33 967.25 972.75 973.25 566-574 
34 975.25 580.75 981.25 974-582 
35 583.25 588.75 589.25 582-590 
36 991.25 596.75 597.25 590-598 
37 599.25 604.75 605.25 598-606 
38 607.25 612.75 613.25 606-614 
39 615.25 620.75 621.25 614-622 


Table 8.6a. Analog Broadcast Nominal Frequencies for the 
1United Kingdom, +treland, tsouth Africa, tHong Kong, and 


2Western Europe. 
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Audio 
! Carrier 
Broadcast Video (MHz) Phannel 
Channel Carrier Range 
(MHz) (MHz) 
(G, H) PAL (1) PAL 
40 623.25 628.75 629.25 622-630 
41 631.25 636.75 637.25 630-638 
42 639.25 644.75 645.25 638-646 
43 647.25 652.75 653.25 646-654 
44 655.25 660.75 661.25 654-662 
45 663.25 668.75 669.25 662-670 
46 671.25 676.75 677.25 670-678 
47 679.25 684.75 685.25 678-686 
48 687.25 692.75 693.25 686-694 
49 695.25 700.75 701.25 694-702 
50 703.25 708.75 709.25 702-710 
51 711.25 716.75 717.25 710-718 
52 719.25 724.75 725.25 718-726 
53 727.25 732.75 733.25 726-734 
54 735.25 740.75 741.25 734-742 
55 743.25 748.75 749.25 742-750 
56 751.25 756.75 757.25 750-758 
57 759.25 764.75 765.25 758-766 
58 767.25 772.75 773.25 766-774 
59 775.25 780.75 781.25 774-782 
60 783.25 788.75 789.25 782-790 
61 791.25 796.75 797.25 790-798 
62 799.25 804.75 805.25 798-806 
63 807.25 812.75 813.25 806-814 
64 815.25 820.75 821.25 814-822 
65 823.25 828.75 829.25 822-830 
66 831.25 836.75 837.25 830-838 
67 839.25 844.75 845.25 838-846 
68 847.25 852.75 853.25 846-854 
69 855.25 860.75 861.25 854-862 


Table 8.6b. Analog Broadcast Nominal Frequencies for the 
United Kingdom, Ireland, South Africa, Hong Kong, and 
Western Europe. 
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Cable Video Audio Channel Cable Video Audio Channel 
Channel Carrier Carrier Range Channel Carrier Carrier Range 
(MHz) (MHz) (MHz) (MHz) (MHz) (MHz) 

E2 48.25 53.75 47-54 $11 231.25 236.75 230-237 
E3 59.25 60.75 54-61 $12 238.25 243.75 237-244 
E4 62.25 67.75 61-68 $13 245.25 250.75 244-251 
$01 69.25 74.75 68-75 $14 252.25 257.75 251-258 
S 02 76.25 81.75 75-82 $15 259.25 264.75 258-265 
S 03 83.25 88.75 82-89 $16 266.25 271.75 265-272 
$1 105.25 110.75 104-111 $17 273.25 278.75 272-279 
$2 112.25 117.75 111-118 $18 280.25 285.75 279-286 
$3 119.25 124.75 118-125 $19 287.25 292.75 286-293 
$4 126.25 131.75 125-132 S 20 294.25 299.75 293-300 
$5 133.25 138.75 132-139 $21 303.25 308.75 302-310 
S6 140.75 145.75 139-146 S 22 311.25 316.75 310-318 
S7 147.75 152.75 146-153 S 23 319.25 324.75 318-326 
$8 154.75 159.75 153-160 S$ 24 327.25 332.75 326-334 
S9 161.25 166.75 160-167 $25 335.25 340.75 334-342 
$10 168.25 173.75 167-174 S 26 343.25 348.75 342-350 
- - - - S27 351.25 356.75 350-358 

- - - - S 28 359.25 364.75 358-366 

- - - - S$ 29 367.25 372.75 366-374 
E5 175.25 180.75 174-181 S$ 30 375.25 380.75 374-382 
E6 182.25 187.75 181-188 $31 383.25 388.75 382-390 
E7 189.25 194.75 188-195 S 32 391.25 396.75 390-398 
E8 196.25 201.75 195-202 S 33 399.25 404.75 398-406 
E9 203.25 208.75 202-209 S 34 407.25 412.75 406-414 
E10 210.25 215.75 209-216 $35 415.25 420.75 414-422 
E11 217.25 222.75 216-223 S 36 423.25 428.75 422-430 
E12 224.25 229.75 223-230 S$ 37 431.25 436.75 430-438 
- - - - S 38 439.25 444.75 438-446 

- - - - S$ 39 447.25 452.75 446-454 

- - - - S 40 455.25 460.75 454-462 

- - - - S$ 41 463.25 468.75 462-470 


Table 8.7. Analog Cable TV Nominal Frequencies for the United Kingdom, Ireland, South Africa, 


Hong Kong, and Western Europe. 
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Use by Country 


Figure 8.19 shows the common designations 
for PAL systems. The letters refer to the mono- 
chrome standard for line and field rate, video 
bandwidth (4.2, 5.0, 5.5, or 6.0 MHz), audio 
carrier relative frequency, and RF channel 
bandwidth (6.0, 7.0, or 8.0 MHz). The “PAL” 
refers to the technique to add color informa 
tion to the monochrome signal. Detailed tim- 
ing parameters may be found in Table 8.9. 


The following countries use the (]) PAL 


standard. 
Angola Malawi 
Botswana Namibia 
China Nigeria 
Gambia South Africa 
Guinea-Bissau Tanzania 
Ireland United Kingdom 
Lesotho Zanzibar 
Macau 


The following countries use the (B) and 


(G) PAL standards. 


Albania Kenya 
Algeria Kuwait 
Austria Liberia 
Bahrain Libya 
Cambodia Lithuania 
Cameroon Luxembourg 
Croatia Malaysia 
Cyprus Netherlands 
Denmark New Zealand 
Egypt Nor way 
Equatorial Guinea Oman 
Ethiopia Pakistan 
Finland Papua New Guinea 
Germany Portugal 
Iceland Qatar 

Israel Sierra Leone 
Italy Singapore 
Jordan Slovenia 


Somalia Syria 
Spain Thailand 
Sri Lanka Turkey 
Sudan Yemen 
Sweden Yugoslavia 
Switzerland 


The following countries use the (N) PAL 
standard. Note that Argentina uses a modified 
PAL standard, called “Nc.” 


Paraguay 
Uruguay 


Argentina 
Aryenuna 


The following countries use the (M) PAL 
standard. 


Brazil 


The following countries use the (B) PAL 
standard. 


Australia Indonesia 
Bangladesh Maldives 

Belgium Malta 

Brunei Darussalam Nigeria 

Estonia Rwandese Republic 
Ghana Sao Tome & Principe 
India Seychelles 


The following countries use the (G) PAL 
standard. 


Hungary Zambia 
Mozambique Zimbabwe 
Romania 


The following countries use the (D) PAL 
standard. 


China Latvia 
Czech Republic Poland 
Hungary Romania 


Korea, North 
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The following countries use the (H) PAL 
standard. 


Belgium 


Luminance Equation Derivation 


The equation for generating luminance from 
RGB information is determined by the chroma- 
ticities of the three primary colors used by the 
receiver and what color white actually is. 

The chromaticities of the RGB primaries 
and reference white (CIE illuminate Dg;) are: 


R: x,= 0.64 y,=0.33 z,=0.03 
G: x,=0.29 y,g=0.60 z,=0.11 
B: x,=0.15 yp=0.06 2, =0.79 


white: X= 0.3127 yy = 0.3290 
Zy, = 0.3583 


where x and y are the specified CIE 1931 chro- 
maticity coordinates; z is calculated by know- 
ing thatx+y+z=1. 

As with NTSC, substituting the known val- 
ues gives us the solution for K,, Kg, and Kj: 


r -1 
K, 0.3127/0.3290] |0.64 0.29 0.15 


K,| = 1 0.33 0.60 0.06 
K,| _ |0.358370.3290] [0.03 0.11 0.79 
[0.674 
= 11.177 
[1.190 


Y is defined to be 


Y = (K,y,)R’ + Kgyg)G’ + (Kpyp) B’ 
= (0.674) (0.33)R’ + (1.177) (0.60) G’ 
+ (1.190) (0.06) B’ 


or 
Y = 0.222R’ + 0.706G’ + 0.071B’ 


However, the standard Y = 0.299R’ + 
0.587G’ + 0.114B’ equation is still used. Adjust- 
ments are made in the receiver to minimize 
color errors. 
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PALplus 


PALplus (ITU-R BT:1197 and ETSI ETS 300 
731) is the result of a cooperative project 
started in 1990, undertaken by several Euro- 
pean broadcasters. By 1995, they wanted to 
provide an enhanced definition television sys- 
tem (EDTV), compatible with existing receiv- 
ers. PALplus has been transmitted by a few 
broadcasters since 1994. 

A PALplus picture has a 16:9 aspect ratio. 
On conventional TVs, it is displayed as a 16:9 
letterboxed image with 430 active lines. On 
PALplus TVs, it is displayed as a 16:9 picture 
with 574 active lines, with extended vertical 
resolution. The full video bandwidth is avail- 
able for luminance detail. Cross color artifacts 
are reduced by clean encoding. 


Wide Screen Signalling 

Line 23 contains a Widescreen Signalling 
(WSS) control signal, defined by ITU-R 
BT:1119 and ETSI EN 300 294, used by PAL- 
plus TVs. This signal indicates: 


Program Aspect Ratio: 
Full Format 4:3 
Letterbox 14:9 Center 
Letterbox 14:9 Top 
Full Format 14:9 Center 
Letterbox 16:9 Center 
Letterbox 16:9 Top 
Full Format 16:9 Anamorphic 
Letterbox > 16:9 Center 


Enhanced services: 
Camera Mode 
Film Mode 


Subtitles: 
Teletext Subtitles Present 
Open Subtitles Present 


PALplus is defined as being Letterbox 16:9 
center, camera mode or film mode, helper sig- 
nals present using modulation, and clean 
encoding used. Teletext subtitles may or may 
not be present, and open subtitles may be 
present only in the active picture area. 

During a PALplus transmission, any active 
video on lines 23 and 623 is blanked prior to 
encoding. In addition to WSS data, line 23 
includes 48 +1 cycles of a 300 +9 mV subcar- 
rier with a -U phase, starting 51 us +250 ns 
after Oy. Line 623 contains a 10 us +250 ns 
white pulse, starting 20 us +250 ns after Oy. 

A PALplus TV has the option of deinterlac- 
ing a Film Mode signal and displaying it on a 
50-Hz progressive-scan display or using field 
repeating on a 100-Hz interlaced display. 


Ghost Cancellation 

An optional ghost cancellation signal on line 
318, defined by ITU-R BT.1124 and ETSI ETS 
300 732, allows a suitably adapted TV to mea- 
sure the ghost signal and cancel any ghosting 
during the active video. A PALplus TV may or 
may not support this feature. 


Vertical Filtering 
All PALplus sources start out as a 16:9 YCbCr 
anamorphic image, occupying all 576 active 
scan lines. Any active video on lines 23 and 623 
is blanked prior to encoding (since these lines 
are used for WSS and reference information), 
resulting in 574 active lines per frame. Lines 
24-310 and 336-622 are used for active video. 
Before transmission, the 574 active scan 
lines of the 16:9 image are squeezed into 430 
scan lines. To avoid aliasing problems, the ver- 
tical resolution is reduced by lowpass filtering. 
For Y, vertical filtering is done using a 
Quadrature Mirror Filter (QMF) highpass and 
lowpass pair. Using the QMF process allows 
the highpass and lowpass information to be 
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resampled, transmitted, and later recombined 
with minimal loss. 

The Y QMF lowpass output is resampled 
into three-quarters of the original height; little 
information is lost to aliasing. After clean 
encoding, it is the letterboxed signal that con- 
ventional 4:3 TVs display. 

The Y QMF highpass output contains the 
rest of the original vertical frequency. It is 
used to generate the helper signals that are 
transmitted using the “black” scan lines not 
used by the letterbox picture. 


Film Mode 

A film mode broadcast has both fields of a 
frame coming from the same image, as is usu- 
ally the case with a movie scanned on a tele- 
cine. 

In film mode, the maximum vertical reso- 
lution per frame is about 287 cycles per active 
picture height (cph), limited by the 574 active 
scan lines per frame. 

The vertical resolution of Y is reduced to 
215 cph so it can be transmitted using only 430 
active lines. The QMF lowpass and highpass 
filters split the Y vertical information into DC- 
215 cph and 216-287 cph. 

The Y lowpass information is re-scanned 
into 430 lines to become the letterbox image. 
Since the vertical frequency is limited to a 
maximum of 215 cph, no information is lost. 

The Y highpass output is decimated so 
only one in four lines are transmitted. These 
144 lines are used to transmit the helper sig- 
nals. Because of the QMF process, no informa- 
tion is lost to decimation. 

The 72 lines above and 72 lines below the 
central 430-line of the letterbox image are used 
to transmit the 144 lines of the helper signal. 
This results in a standard 574 active line pic- 
ture, but with the original image in its correct 
aspect ratio, centered between the helper sig- 
nals. The scan lines containing the 300 mV 


helper signals are modulated using the U sub- 
carrier so they look black and are not visible to 
the viewer. 

After Fixed ColorPlus processing, the 574 
scan lines are PAL encoded and transmitted as 
a standard interlaced PAL frame. 


Camera Mode 

Camera (or video) mode assumes the fields of 
a frame are independent of each other, as 
would be the case when a camera scans a 
scene in motion. Therefore, the image may 
have changed between fields. Only intra-field 
processing is done. 

In camera mode, the maximum vertical 
resolution per field is about 143 cycles per 
active picture height (cph), limited by the 287 
active scan lines per field. 

The vertical resolution of Y is reduced to 
107 cph so it can be transmitted using only 215 
active lines. The QMF lowpass and highpass 
filter pair split the Y vertical information into 
DC-107 cph and 108-143 cph. 

The Y lowpass information is re-scanned 
into 215 lines to become the letterbox image. 
Since the vertical frequency is limited to a 
maximum of 107 cph, no information is lost. 

The Y highpass output is decimated so 
only one in four lines is transmitted. These 72 
lines are used to transmit the helper signals. 
Because of the QMF process, no information is 
lost to decimation. 

The 36 lines above and 36 lines below the 
central 215-line of the letterbox image are used 
to transmit the 72 lines of the helper signal. 
This results in a 287 active line picture, but 
with the original image in its correct aspect 
ratio, centered between the helper signals. The 
scan lines containing the 300 mV helper sig- 
nals are modulated using the U subcarrier so 
they look black and are not visible to the 
viewer. 
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After either Fixed or Motion Adaptive Col- 
orPlus processing, the 287 scan lines are PAL 
encoded and transmitted as a standard PAL 
field. 


Clean Encoding 

Only the letterboxed portion of the PALplus 
signal is clean encoded. The helper signals are 
not actual PAL video. However, they are close 
enough to video to pass through the transmis- 
sion path and remain fairly invisible on stan- 
dard TVs. 


ColorPlus Processing 


Fixed ColorPlus 

Film Mode uses a Fixed ColorPlus technique, 
making use of the lack of motion between the 
two fields of the frame. 

Fixed ColorPlus depends on the subcar- 
rier phase of the composite PAL signal being of 
opposite phase when 312 lines apart. If these 
two lines have the same luminance and 
chrominance information, it can be separated 
by adding and subtracting the composite sig- 
nals from each other. Adding cancels the 
chrominance, leaving luminance. Subtracting 
cancels the luminance, leaving chrominance. 

In practice, Y information above 3 MHz 
(Yyp) is intra-frame averaged since it shares 
the frequency spectrum with the modulated 
chrominance. For line [n], Yyr is calculated as 
follows: 


0<n< 214 for 430-line letterboxed image 


YHF(60 +n) = 9-9(Y F372 +n) + YHFO6O +n) 
YHF(372 +n) = YHF(60 +n) 


Yup is then added to the low-frequency Y 
(Y,p) information. The same intra-frame aver- 
aging process is also used for Cb and Cr. The 
430-line letterbox image is then PAL encoded. 

Thus, Y information above 3 MHz, and 
CbCr information, is the same on lines [n] and 
[n+312]. Y information below 3 MHz may be 
different on lines [n] and [n+312]. The full ver- 
tical resolution of 287 cph is reconstructed by 
the decoder with the aid of the helper signals. 


Motion Adaptive ColorPlus (MACP) 

Camera Mode uses either Motion Adaptive 
ColorPlus or Fixed ColorPlus, depending on 
the amount of motion between fields. This 
requires a motion detector in both the encoder 
and decoder. 

To detect movement, the CbCr data on 
lines [n] and [n+312] are compared. If they 
match, no movement is assumed, and Fixed 
ColorPlus operation is used. If the CbCr data 
doesn’t match, movement is assumed, and 
Motion Adaptive ColorPlus operation is used. 

During Motion Adaptive ColorPlus opera- 
tion, the amount of Yyp added to Y;f is depen- 
dent on the difference between CbCr() and 
CbCr(q 4 312). For the maximum CbCr differ- 
ence, no Yyp data for lines [n] and [n+312] is 
transmitted. 

In addition, the amount of intra-frame aver- 
aged CbCr data mixed with the direct CbCr 
data is dependent on the difference between 
CbCri) and CbCr@ , 319). For the maximum 
CbCr difference, only direct CbCr data is trans- 
mitted separately for lines [n] and [n+312]. 
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SECAM Overview 


SECAM (Sequentiel Couleur Avec Mémoire or 
Sequential Color with Memory) was developed 
in France, with broadcasting starting in 1967, 
by realizing that, if color could be bandwidth- 
limited horizontally, why not also vertically? 
The two pieces of color information (Db and 
Dr) added to the monochrome signal could be 
transmitted on alternate lines, avoiding the 
possibility of crosstalk. 

The receiver requires memory to store 
one line so that it is concurrent with the next 
line, and also requires the addition of a line- 
switching identification technique. 

Like PAL, SECAM is a 625-line, 50-field- 
per-second, 2:1 interlaced system. SECAM was 
adopted by other countries; however, many are 
changing to PAL due to the abundance of pro- 
fessional and consumer PAL equipment. 


Luminance Information 


The monochrome luminance (Y) signal is 
derived from R’G’B:: 


Y = 0.299R’ + 0.587G’ + 0.114B’ 


As with NTSC and PAL, the luminance sig- 
nal occupies the entire video bandwidth. 
SECAM has several variations, depending on 
the video bandwidth and placement of the 
audio subcarrier. The video signal has a band- 
width of 5.0 or 6.0 MHz, depending on the spe- 
cific SECAM standard. 


Color Information 


SECAM transmits Db information during one 
line and Dr information during the next line; 
luminance information is transmitted each 


line. Db and Dr are scaled versions of B’ - Y 
and R’-Y: 


Dr =-1.902(R’- Y) 
Db = 1.505(B’ - Y) 


Since there is an odd number of lines, any 
given line contains Db information on one field 
and Dr information on the next field. The 
decoder requires a 1-H delay, switched syn- 
chronously with the Db and Dr switching, so 
that Db and Dr exist simultaneously in order to 
convert to YCbCr or RGB. 


Color Modulation 


SECAM uses FM modulation to transmit the 
Db and Dr color difference information, with 
each component having its own subcarrier. 

Db and Dr are lowpass filtered to 1.3 MHz 
and pre-emphasis is applied. The curve for the 
pre-emphasis is expressed by: 


ee +ifs) 
1+ ;) 


where f = signal frequency in kHz. 

After pre-emphasis, Db and Dr frequency 
modulate their respective subcarriers. The fre- 
quency of each subcarrier is defined as: 


Fop = 272 Fy = 4.250000 MHz (+ 2 kHz) 
For = 282 Fy = 4.406250 MHz (+ 2 kHz) 


These frequencies represent no color 
information. Nominal Dr deviation is +280 kHz 
and the nominal Db deviation is +230 kHz. Fig- 
ure 8.23 illustrates the frequency modulation 
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process of the color difference signals. The 
choice of frequency shifts reflects the idea of 
keeping the frequencies representing critical 
colors away from the upper limit of the spec- 
trum to minimize distortion. 

After modulation of Db and Dr, subcarrier 
pre-emphasis is applied, changing the ampli- 
tude of the subcarrier as a function of the fre- 
quency deviation. The intention is to reduce 
the visibility of the subcarriers in areas of low 
luminance and to improve the signal-to-noise 
ratio of highly saturated colors. This pre- 
emphasis is given as: 


1 +jl6F 
1+/1.26F 


where F = (f/4286) — (4286/f), f = instanta- 
neous subcarrier frequency in kHz, and 2M = 
23 + 2.5% of luminance amplitude. 


As shown in Table 8.8 and Figure 8.24, Db 
and Dr information is transmitted on alternate 
scan lines. The phase of the subcarriers is also 
reversed 180° on every third line and between 
each field to further reduce subcarrier visibil- 
ity. Note that subcarrier phase information in 
the SECAM system carries no picture informa- 
tion. 


Composite Video Generation 


The subcarrier data is added to the luminance 
along with appropriate horizontal and vertical 
sync signals, blanking signals, and burst sig- 
nals to generate composite video. 

As with PAL, SECAM requires some 
means of identifying the line-switching 
sequence. Modern practice has been to use a 
For/Fop burst after most horizontal syncs to 
derive the switching synchronization informa- 
tion, as shown in Figure 8.25. 


RED 
+DR 


ODD LINES ~— | 


YELLOW 
-DB 


EVEN LINES — | 


FOB 
4.25 


FOR 6 MHZ 
4.40 


Figure 8.23. SECAM FM Color Modulation. 
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Field aa Color Subcarrier Phase 
odd N Dr 0° 

even N+313 Db 180° 
odd N+1l Db 0° 

even N+314 Dr 0° 
odd N+2 Dr 180° 

even N+315 Db 180° 
odd N+3 Db 0° 

even N+ 316 Dr 180° 
odd N+4 Dr 0° 

even N+317 Db 0° 
odd N+5 Db 180° 

even N+ 318 Dr 180° 


Table 8.8. SECAM Color Versus Line and Field Timing. 


Use by Country The following countries use the (D) and 


SECAM standards. 
Figure 8.26 shows the common designations oe sereence 


for SECAM systems. The letters refer to the 


monochrome standard for line and field rates, inna rie 

video bandwidth (5.0 or 6.0 MHz), audio car- Bulgaria RiSein 

rier relative frequency, and RF channel band- Georgia Wheaties 

width. The SECAM refers to the technique to Ieagak hotan Wiee Nana 

add color information to the monochrome sig- Tata 

nal. Detailed timing parameters may be found 

in Table 8.9. The following countries use the (B) 


The following countries use the (B) and SECAM standard. 
(G) SECAM standards. 


Mauritania Djibouti 
Greece Mauritius 
Iran Morocco The following countries use the (D) 
Iraq Saudi Arabia SECAM standard. 
Lebanon Tunisia 


Mali Afghanistan Mongolia 
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START 
OF 
VSYNC 
DR DB DR DB ANALOG DR DB 
os FIELD 1 AS 
620 621 622 623 624 625 1 2 3 4 5 6 7 23 (|g 
DB DR DB ANALOG DB DR 
= FIELD 2 ~ 
| r ans 
308 309 | 310 3i1 312 313 314 315 316 317 318 319 320 336 337 
DB DR DB DR Uae tsar DB DR 
FIELD 3 
x . 
1 
620 | 621 622 | 623 624 625 1 2 3 4 5 6 7 23 (| old 
DR DB DR fears aaa DR DB 
IELD 4 
i : lO 


308 309 | 310 311 312 313 314 315 316 317 318 319 320 336 337 


Figure 8.24. Four-field SECAM Sequence. See Figure 8.5 for equalization and serration pulse 
details. 


BLANK LEVEL 


SYNC LEVEL 


HORIZONTAL 
BLANKING 


Figure 8.25. SECAM Chroma Synchronization Signals. 
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The following countries use the (K1) 


SECAM standard. 


Benin 

Burkina Faso 

Burundi 

Cape Verde 

Central African 
Republic 

Chad 

Comoros 

Congo 


FREQUENCY MODULATED SUBCARRIERS 
FOR = 282 FH 
FOB = 272 FH 
LINE SEQUENTIAL DR AND DB SIGNALS 


"D, K, K1, L" 


LINE/ FIELD = 625/ 50 
FH = 15.625 KHZ 
FV = 50 HZ 


BLANKING SETUP = 0 IRE 
VIDEO BANDWIDTH = 6.0 MHZ 
AUDIO CARRIER = 6.5 MHZ 
CHANNEL BANDWIDTH = 8 MHZ 


"BG" 


LINE/ FIELD = 625/ 50 
FH = 15.625 KHZ 
FV = 50 HZ 


BLANKING SETUP = 0 IRE 
VIDEO BANDWIDTH = 5.0 MHZ 
AUDIO CARRIER = 5.5 MHZ 
CHANNEL BANDWIDTH: 

B = 7 MHZ 

G = 8 MHZ 


Figure 8.26. Common SECAM Systems. 


Gabon 
Guadeloupe 
Madagascar 
Niger 
Senegal 
Tahiti 

Togo 

Zaire 


The following countries use the 


SECAM standard. 


France 


Monaco 


(L) 
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Luminance Equation Derivation 


The equation for generating luminance from 
RGB information is determined by the chroma- 
ticities of the three primary colors used by the 
receiver and what color white actually is. 

The chromaticities of the RGB primaries 
and reference white (CIE illuminate Dg) are: 


R: x,=0.64 y,=0.33 z,=0.03 
G: x,=0.29 yg=0.60 z,=0.11 
B: xy=0.15 yp=0.06 2 =0.79 


white: X_= 0.3127 yw = 0.3290 
Ing = 0.3583 


where x and y are the specified CIE 1931 chro- 
maticity coordinates; z is calculated by know- 
ing that x + y + z=1. Once again, substituting 
the known values gives us the solution for K,, 
Ky, and K,;: 


r -1 
K, 0.3127/0.3290] |0.64 0.29 0.15 
K,| = 1 0.33 0.60 0.06 
K, |0.3583/0.3290} |0.03 0.11 0.79 


[0.674 
= 11.177 
[1.190 


Y is defined to be 


Y = (K,y,)R’ + (Kgyg)G’ + (Kpyp) B’ 
= (0.674) (0.33)R’ + (1.177) (0.60) G’ 
+ (1.190) (0.06) B’ 


or 
Y = 0.222R’ + 0.706G’ + 0.071B’ 


However, the standard Y = 0.299R’ + 
0.587G’ + 0.114B’ equation is still used. Adjust- 
ments are made in the receiver to minimize 
color errors. 
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M N B,G H I D, K K1 L 
SCAN LINES PER FRAME 525 625 625 
FIELD FREQUENCY 59.94 50 50 
(FIELDS / SECOND) 
LINE FREQUENCY (HZ) 15,734 15,625 15,625 
PEAK WHITE LEVEL (IRE) 100 100 100 
SYNC TIP LEVEL (IRE) -40 -40 (-43) -43 
SETUP (IRE) 75225 |7.5+2.5 (0) 0 
PEAK VIDEO LEVEL (IRE) 120 133 133 115 115 125 
GAMMA OF RECEIVER 2.2 2.8 2.8 28 2.8 2.8 2.8 2.8 
VIDEO BANDWIDTH (MHZ) 4.2 5.0 (4.2) 5.0 5.0 5.5 6.0 6.0 6.0 
LUMINANCE SIGNAL Y = 0.299R’ + 0.685G’ + 0.114B’ (RGB ARE GAMMA-CORRECTED) 


1 Values in parentheses apply to (Nc) PAL used in Argentina. 


Table 8.9a. Basic Characteristics of Color Video Signals. 


294 


Chapter 8: NTSC, PAL, and SECAM Overview 


synchronizing pulses (10%, 90%) (ns) 


ioe B, D, G, H, I 
Characteristics M N K, K1, L, No 
Nominal line period (us) 63.5555 64 64 
Line blanking interval (us) 10.740.1 10.88 + 0.64 11.85+0.15 
Oy to start of active video (Us) 9.2+0.1 9.6 + 0.64 10.5 
Front porch (us) 1.5+0.1 1.92 + 0.64 1.65 + 0.15 
Line synchronizing pulse (us) 4.7+0.1 4.99 + 0.77 4.7+0.2 
Rise and fall time of line 
blanking (10%, 90%) (ns) 140 + 20 300 + 100 300 + 100 
Rise and fall time of line 
140 + 20 < 250 250 + 50 


Notes: 


1. Oy is at 50% point of falling edge of horizontal sync. 


2. In case of different standards having different specifications and tolerances, the 
tightest specification and tolerance is listed. 
3. Timing is measured between half-amplitude points on appropriate signal edges. 


Table 8.9b. Details of Line Synchronization Signals. 


SECAM Overview 295 
ee B, D, G, H, | 

Characteristics M N K, K1, L, No 
Field period (ms) 16.6833 20 20 
Field blanking interval 20 lines 19-25 lines 25 lines 
Rise and fall time of field blanking 
(10%, 90%) (ns) 140 + 20 < 250 300 + 100 
Duration of equalizing and 3H 3H 25H 
synchronizing sequences 
Equalizing pulse width (us) 2.3401 2.43 + 0.13 2.35+0.1 
Serration pulse width (us) 4.7+0.1 4.7+0.8 4.7+0.1 
Rise and fall time of synchronizing and 
equalizing pulses (10%, 90%) (ns) 140-20 S200 250 + 50 


Notes: 

1. In case of different standards having different specifications and tolerances, the tightest 
specification and tolerance is listed. 

2. Timing is measured between half-amplitude points on appropriate signal edges. 


Table 8.9c. Details of Field Synchronization Signals. 
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M/ NTSC 


M/PAL B, D, G, H, |, N/ PAL B, D, G, K, K1, K/ SECAM 


ATTENUATION OF COLOR 
DIFFERENCE SIGNALS 


U, V, 1, Q: 
<2DB AT 1.3MHZ 
>20DB AT 3.6 MHZ 


OR Q: 
<2DB AT 0.4 MHZ 
<6DB AT 0.5 MHZ 
>6DB AT 0.6 MHZ 


<2DB AT 1.3 MHZ 
>20DB AT 3.6 MHZ 


<3DB AT 1.3MHZ 
>20DB AT 4MHZ 
(> 20 DB AT 3.6 MHZ) 


<3DB AT 1.3MHZ 
>30DB AT 3.5 MHZ 


(BEFORE LOW FREQUENCY 
PRE-CORRECTION) 


START OF BURST 
AFTER OH (uS) 


5.3 + 0.07 


5.8 + 0.1 


BURST DURATION (CYCLES) 921 


10 + 1 (9 + 1) 


BURST PEAK AMPLITUDE 40 + 1 IRE 


42.86 + 4 IRE 


42.86 + 4 IRE 


Note: Values in parentheses apply to (Nc) PAL used in Argentina. 


Table 8.9d. Basic Characteristics of Color Video Signals. 


Video Test Signals 


Many industry-standard video test signals 
have been defined to help test the relative qual- 
ity of encoding, decoding, and the transmis- 
sion path, and to perform calibration. Note that 
some video test signals cannot be properly 
generated by providing RGB data to an 
encoder; in this case, YCbCr data may be used. 

If the video standard uses a 7.5-IRE setup, 
typically only test signals used for visual exam- 
ination use the 7.5-IRE setup. Test signals 
designed for measurement purposes typically 
use a 0-IRE setup, providing the advantage of 
defining a known blanking level. 


Color Bars Overview 


Color bars are one of the standard video test 
signals, and there are several variations, 
depending on the video standard and applica- 
tion. For this reason, this section reviews the 


most common color bar formats. Color bars 
have two major characteristics: amplitude and 
saturation. 

The amplitude of a color bar signalis 
determined by: 


max (R, G, B) 


= oe x 100 


max (R, G, B), 


amplitude (%) 


where max(R,G,B), is the maximum value of 
R'G’B’ during colored bars and max(R,G,B)p 
is the maximum value of R’G’B’ during refer- 
ence white. 

The saturation of a color bar signal is less 
than 100% if the minimum value of any one of 
the R’G’B’ components is not zero. The satura- 
tion is determined by: 


saturation (%) = 


min(R, G, 
ees G, B) | facie 
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where min(R,G,B) and max(R,G,B) are the 
minimum and maximum values, respectively, 
of R’G’B’ during colored bars, and y is the 
gamma exponent, typically [1/0.45]. 


NTSC Color Bars 

In 1953, it was normal practice for the analog 
R’G'B’ signals to have a 7.5 IRE setup, and the 
original NTSC equations assumed this form of 
input to an encoder. Today, digital R’G’B’ or 
YCbCr signals typically do not include the 7.5 
IRE setup, and the 7.5 IRE setup is added 
within the encoder. 

The different color bar signals are 
described by four amplitudes, expressed in 
percent, separated by oblique strokes. 100% 
saturation is implied, so saturation is not speci- 
fied. The first and second numbers are the 
white and black amplitudes, respectively. The 
third and fourth numbers are the white and 
black amplitudes from which the color bars are 
derived. 

For example, 100/7.5/75/7.5 color bars 
would be 75% color bars with 7.5% setup in 
which the white bar has been set to 100% and 
the black bar to 7.5%. Since NTSC systems usu- 
ally have the 7.5% setup, the two common color 
bars are 75/7.5/75/7.5 and 100/7.5/100/7.5, 
which are usually shortened to 75% and 100%, 
respectively. The 75% bars are most commonly 
used. Television transmitters do not pass infor- 
mation with an amplitude greater than about 
120 IRE. Therefore, the 75% color bars are 
used for transmission testing. The 100% color 
bars may be used for testing in situations 
where a direct connection between equipment 
is possible. The 75/7.5/75/7.5 color bars are a 
part of the Electronic Industries Association 
EIA-189-A Encoded Color Bar Standard. 

Figure 8.27 shows a typical vectorscope 
display for full-screen 75% NTSC color bars. 
Figure 8.28 illustrates the video waveform for 
75% color bars. 


Tables 8.10 and 8.11 list the luminance and 
chrominance levels for the two common color 
bar formats for NTSC. 

For reference, the RGB and YCbCr values 
to generate the standard NTSC color bars are 
shown in Tables 8.12 and 8.13. RGB is 
assumed to have a range of 0-255; YCbCr is 
assumed to have a range of 16-235 for Y and 
16-240 for Ch and Cr. It is assumed any 7.5 IRE 
setup is implemented within the encoder. 


PAL Color Bars 

Unlike NTSC, PAL does not support a 7.5 IRE 
setup; the black and blank levels are the same. 
The different color bar signals are usually 
described by four amplitudes, expressed in 
percent, separated by oblique strokes. The 
first and second numbers are the maximum 
and minimum percentages, respectively, of 
R’G’'B’ values for an uncolored bar. The third 
and fourth numbers are the maximum and 
minimum percentages, respectively, of R’G’B’ 
values for a colored bar. 

Since PAL systems have a 0% setup, the 
two common color bars are 100/0/75/0 and 
100/0/100/0, which are usually shortened to 
75% and 100%, respectively. The 75% color bars 
are used for transmission testing. The 100% 
color bars may be used for testing in situations 
where a direct connection between equipment 
is possible. 

The 100/0/75/0 color bars also are 
referred to as EBU (European Broadcast 
Union) color bars. All of the color bars dis- 
cussed in this section are also a part of Specifi- 
cation of Television Standards for 625-line 
System-I Transmissions (1971) published by the 
Independent Television Authority (ITA) and 
the British Broadcasting Corporation (BBC), 
and ITU-R BT.-471. 
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VM700A Video Measurement Set 
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Figure 8.27. Typical Vectorscope Display for 75% NTSC Color Bars. 
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Ww z 2 Na 
E =I Zz TH Ww uw 
+ 100 
I WHITE LEVEL (100 IRE) 
+89 
+77 
+72 
100 IRE +56 
+48 
+46 
3.58 MHZ +36 
COLOR BURST +38 +28 
(9 CYCLES) 
+12 +15 
20 IRE }— BLACK LEVEL (7.5 IRE) 
1 hrs IRE +7 
i BLANK LEVEL (0 IRE) 
20 IRE =9 
-16 -16 
40 IRE 
' SYNC LEVEL (-40 IRE) 
Figure 8.28. IRE Values for 75% NTSC Color Bars. 
i Minimum Maximum Fi 
: Chrominance i A Chrominance 
Luminance Chrominance Chrominance 
Level ; Phase 
(IRE) (IRE) Excursion Excursion (degrees) 
(IRE) (IRE) 8 
white 76.9 0 - - - 
yellow 69.0 62.1 37.9 100.0 167.1 
cyan 56.1 87.7 12.3 100.0 283.5 
green 48.2 81.9 7.3 89.2 240.7 
magenta 36.2 81.9 4.8 77.1 60.7 
red 28.2 87.7 -15.6 72.1 103.5 
blue 15.4 62.1 -15.6 46.4 347.1 
black 7.5 0 - - - 


Table 8.10. 75/7.5/75/7.5 (75%) NTSC Color Bars. 


300 = Chapter 8: NTSC, PAL, and SECAM Overview 


Fi Minimum Maximum F 
Luminance pee Chrominance Chrominance chromlnanes 
(IRE) ee) Excursion Excursion ( ees 
(IRE) (IRE) 
white 100.0 0 - - - 
yellow 89.5 82.8 48.1 130.8 167.1 
cyan 72.3 117.0 13.9 130.8 283.5 
green 61.8 109.2 7.2 116.4 240.7 
magenta 45.7 109.2 -8.9 100.3 60.7 
red 35.2 117.0 -23.3 93.6 103.5 
blue 18.0 82.8 -23.3 59.4 347.1 
black 7.5 0 - - - 
Table 8.11. 100/7.5/100/7.5 (100%) NTSC Color Bars. 
2 3 P 5 2 e ° x 
= 
gamma-corrected RGB (gamma = 1/0.45) 
R’ 191 191 0 0 191 191 0 
G’ 191 191 191 191 0 0 0 
B’ 191 0 191 0 191 0 191 
linear RGB 
R 135 135 0 0 135 135 0 
G 135 135 135 135 0 0 0 
B 135 0 135 0 135 0 135 
YCbCr 
Y 180 162 131 112 84 65 35 16 
Cb 128 44 156 72 184 100 212 128 
Cr 128 142 44 58 198 212 114 128 


Table 8.12. RGB and YCbCr Values for 75% NTSC Color Bars. 
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a 
= 
gamma-corrected RGB (gamma = 1/0.45) 
R’ 255 255 0 255 255 0 
G’ 255 255 255 0 0 0 0 
B’ 255 0 255 255 0 255 0 
linear RGB 
R 255 255 0 255 255 0 0 
G 255 255 255 0 0 0 0 
B 255 0 255 255 0 255 0 
YCbCr 

Y 235 210 170 106 81 41 16 
Cb 128 16 166 202 90 240 128 
Cr 128 146 16 222 240 110 128 


Table 8.13. RGB and YCbCr Values for 100% NTSC Color Bars. 


Figure 8.29 illustrates the video waveform 
for 75% color bars. Figure 8.30 shows a typical 
vectorscope display for full-screen 75% PAL 
color bars. 

Tables 8.14, 8.15, and 8.16 list the lumi- 
nance and chrominance levels for the three 
common color bar formats for PAL. 

For reference, the RGB and YCbCr values 
to generate the standard PAL color bars are 
shown in Tables 8.17, 8.18, and 8.19. RGB is 
assumed to have a range of 0-255; YCbCr is 
assumed to have a range of 16-235 for Y and 
16-240 for Cb and Cr. 


EIA Color Bars (NTSC) 


The EIA color bars (Figure 8.28 and Table 
8.10) are a part of the EIA-189-A standard. The 
seven bars (gray, yellow, cyan, green, magenta, 
red, and blue) are at 75% amplitude, 100% satu- 
ration. The duration of each color bar is 1/7 of 
the active portion of the scan line. Note that 
the black bar in Figure 8.28 and Table 8.10 is 
not part of the standard and is shown for refer- 
ence only. The color bar test signal allows 
checking for hue and color saturation accu- 
racy. 
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Chrominance 


Peak-to-Peak Chrominance Phase 
Luminance (denree?) 
(volts) 
U axis V axis Total Line n Linen +1 
(volts) (volts) (volts) (135° burst) (225° burst) 
white 0.700 0 - - - - 
yellow 0.465 0.459 0.105 0.470 167 193 
cyan 0.368 0.155 0.646 0.664 283.5 76.5 
green 0.308 0.304 0.541 0.620 240.5 119.5 
magenta 0.217 0.304 0.541 0.620 60.5 299.5 
red 0.157 0.155 0.646 0.664 103.5 256.5 
blue 0.060 0.459 0.105 0.470 347 13.0 
black 0 0 0 0 - - 
Table 8.14. 100/0/75/0 (75%) PAL Color Bars. 
Chrominance 
Peak-to-Peak Chrominance Phase 
Luminance (deereee) 
(volts) 
U axis V axis Total Line n Linen+1 
(volts) (volts) (volts) (135° burst) (225° burst) 
white 0.700 0 - - - - 
yellow 0.620 0.612 0.140 0.627 167 193 
cyan 0.491 0.206 0.861 0.885 283.5 76.5 
green 0.411 0.405 0.721 0.827 240.5 119.5 
magenta 0.289 0.405 0.721 0.827 60.5 299.5 
red 0.209 0.206 0.861 0.885 103.5 256.5 
blue 0.080 0.612 0.140 0.627 347 13.0 
black 0 0 0 0 - - 


Table 8.15. 100/0/100/0 (100%) PAL Color Bars. 
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Chrominance 
Peak-to-Peak Chrominance Phase 
Luminance (degrees) 
(volts) 
U axis V axis Total Line n Linen+1 
(volts) (volts) (volts) (135° burst) (225° burst) 
white 0.700 0 - - - - 
yellow 0.640 0.459 0.105 0.470 167 193 
cyan 0.543 0.155 0.646 0.664 283.5 76.5 
green 0.483 0.304 0.541 0.620 240.5 119.5 
magenta 0.392 0.304 0.541 0.620 60.5 299.5 
red 0.332 0.155 0.646 0.664 103.5 256.5 
blue 0.235 0.459 0.105 0.470 347 13.0 
black (0) 0 0 0 - - 
Table 8.16. 100/0/100/25 (98%) PAL Color Bars. 
x 
3 z & 
Ww io} a fi < 
+100 
r WHITE LEVEL (100 IRE) 
+ 88 
+75 
+66 +69 
100 IRE 
+53 
+44 +43 
4.43 MHZ 
COLOR BURST +31 
(10 CYCLES) +32 rE 
| 21.43 IRE sa 
{ aa ; BLACK / BLANK LEVEL (0 IRE) 
[> IRE 
=13 
43 IRE —25 -25 


Figure 


8.29. 


SYNC LEVEL (- 43 IRE) 


IRE Values for 75% PAL Color Bars. 
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Figure 8.30. Typical Vectorscope Display for 75% PAL Color Bars. 


Video Test Signals 


305 


a 
= 
gamma-corrected RGB (gamma = 1/0.45) 
R’ 255 191 0 0 191 191 0 
G’ 255 191 191 191 0 0 0 
B’ 255 0 191 0 191 0 191 
linear RGB 
R 255 135 0 0 135 135 0 
G 255 135 135 135 0 0 0 
B 255 0 135 0 135 0 135 
YCbCr 
Y 235 162 131 112 84 65 35 16 
Cb 128 44 156 72 184 100 212 128 
Cr 128 142 44 58 198 212 114 128 
Table 8.17. RGB and YCbCr Values for 75% PAL Color Bars. 
2 3 P 5 2 5 ° x 
= 
gamma-corrected RGB (gamma = 1/0.45) 
R’ 255 255 0 0 255 255 0 
G’ 255 255 255 255 0 0 0 
B’ 255 0 255 0 255 0 255 
linear RGB 
R 255 255 0 0 255 255 0 
G 255 255 255 255 0 0 0 
B 255 0 255 0 255 0 255 
YCbCr 
Y 235 210 170 145 106 81 41 16 
Cb 128 16 166 54 202 90 240 128 
Cr 128 146 16 34 222 240 110 128 


Table 8.18. RGB and YCbCr Values for 100% PAL Color Bars. 
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iy] 
® = < + x 
= 
gamma-corrected RGB (gamma = 1/0.45) 
R’ 255 255 44 44 255 255 44 44 
G’ 255 255 255 255 44 44 44 44 
B’ 255 44 255 44 255 44 255 44 
linear RGB 
R 255 255 5 5 255 255 5 5 
G 255 255 255 255 5 5 5 5 
B 255 5 255 5 255 5 255 5 
YCbCr 
Y 235 216 186 167 139 120 90 16 
Cb 128 44 156 72 184 100 212 128 
Cr 128 142 44 58 198 212 114 128 
Table 8.19. RGB and YCbCr Values for 98% PAL Color Bars. 


EBU Color Bars (PAL) 


The EBU color bars are similar to the EIA 
color bars, except a 100 IRE white level is used 
(see Figure 8.29 and Table 8.14). The six col- 
ored bars (yellow, cyan, green, magenta, red, 
and blue) are at 75% amplitude, 100% satura- 
tion, while the white bar is at 100% amplitude. 
The duration of each color bar is 1/7 of the 
active portion of the scan line. Note that the 
black bar in Figure 8.29 and Table 8.14 is not 
part of the standard and is shown for reference 
only. The color bar test signal allows checking 
for hue and color saturation accuracy. 


SMPTE Bars (NTSC) 


This split-field test signal is composed of the 
FIA color bars for the first 2/3 of the field, the 
Reverse Blue bars for the next 1/12 of the 


field, and the PLUGE test signal for the 
remainder of the field. 


Reverse Blue Bars 


The Reverse Blue bars are composed of the 
blue, magenta, and cyan colors bars from the 
EJA/EBU color bars, but are arranged in a dif- 
ferent order—blue, black, magenta, black, 
cyan, black, and white. The duration of each 
color bar is 1/7 of the active portion of the scan 
line. Typically, Reverse Blue bars are used with 
the EIA or EBU color bar signal in a split-field 
arrangement, with the EJA/EBU color bars 
comprising the first 3/4 of the field and the 
Reverse Blue bars comprising the remainder 
of the field. This split-field arrangement eases 
adjustment of chrominance and hue on a color 
monitor. 
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PLUGE 


PLUGE (Picture Line-Up Generating Equip- 
ment) is a visual black reference, with one area 
blacker-than-black, one area at black, and one 
area lighter-than-black. The brightness of the 
monitor is adjusted so that the black and 
blacker-than-black areas are indistinguishable 
from each other and the lighter-than-black area 
is slightly lighter (the contrast should be at the 
normal setting). Additional test signals, such 
as a white pulse and modulated IQ signals are 
usually added to facilitate testing and monitor 
alignment. 


WHITE 


+ 
° 
ro} 


3.58 MHZ 
COLOR BURST - 
(9 CYCLES) 


SS 


+20 


+Q 


+27.5 + 27.5 


-—12.5 


MICROSECONDS = 9.7 19.1 285 38 


47 49.6 


The NTSC PLUGE test signal (shown in 
Figure 8.31) is composed of a 7.5 IRE (black 
level) pedestal with a 40 IRE “-I” phase modu- 
lation, a 100 IRE white pulse, a 40 IRE “+Q” 
phase modulation, and 3.5 IRE, 7.5 IRE, and 
11.5 IRE pedestals. Typically, PLUGE is used 
as part of the SMPTE bars. 

For PAL, each country has its own slightly 
different PLUGE configuration, with most dif- 
ferences being the black pedestal level used, 
and work is being done on a standard test sig- 
nal. Figure 8.32 illustrates a typical PAL 
PLUGE test signal. Usually used as a full 
screen test signal, it is composed of a 0 IRE 


+ 11.5 


BLACK LEVEL (7.5 IRE) 
BLANK LEVEL (0 IRE) 


SYNC LEVEL (- 40 IRE) 


52 54.5 


Figure 8.31. PLUGE Test Signal for NTSC. IRE values are indicated. 
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pedestal with PLUGE (2 IRE, 0 IRE, and 2 
IRE pedestals) and a white pulse. The white 
pulse may have five levels of brightness (0, 25, 
50, 75, and 100 IRE), depending on the scan 
line number, as shown in Figure 8.32. The 
PLUGE is displayed on scan lines that have 
non-zero IRE white pulses. ITU-R BT:1221 dis- 
cusses considerations for various PAL sys- 
tems. 


4.43 MHZ 
COLOR BURST 
(10 CYCLES) 


+2 


-2 


— 21.43 


MICROSECONDS = 22.5 24.8 27.1 29.4 


Y Bars 


The Y bars consist of the luminance-only levels 
of the EIA/EBU color bars; however, the black 
level (7.5 IRE for NTSC and 0 IRE for PAL) is 
included and the color burst is still present. 
The duration of each luminance bar is there- 
fore 1/8 of the active portion of the scan line. Y 
bars are useful for color monitor adjustment 
and measuring luminance nonlinearity. Typi- 
cally, the Y bars signal is used with the EIA or 
EBU color bar signal in a split-field arrange- 
ment, with the EIA/EBU color bars compris- 
ing the first 3/4 of the field and the Y bars 
signal comprising the remainder of the field. 


WHITE LEVEL (IRE) 
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+100 
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Figure 8.32. PLUGE Test Signal for PAL. IRE values are indicated. 
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Red Field 


The Red Field signal consists of a 75% ampli- 
tude, 100% saturation red chrominance signal. 
This is useful as the human eye is sensitive to 
static noise intermixed in a red field. Distor- 
tions that cause small errors in picture quality 
can be examined visually for the effect on the 
picture. Typically, the Red Field signal is used 
with the EIA/EBU color bars signal in a split- 
field arrangement, with the EJA/EBU color 
bars comprising the first 3/4 of the field, and 
the Red Field signal comprising the remainder 
of the field. 


10-Step Staircase 


This test signal is composed of ten unmodu- 
lated luminance steps of 10 IRE each, ranging 
from 0 IRE to 100 IRE, shown in Figure 8.33. 
This test signal may be used to measure lumi- 
nance nonlinearity. 


IRE 
LEVELS 
™ 50 


COLOR BURST 


\ 


MICROSECONDS = 


60 


17.5 21.5 25.5 29.5 33.5 37.5 41.5 45.5 49.5 53.5 


Modulated Ramp 


The modulated ramp test signal, shown in Fig- 
ure 8.34, is composed of a luminance ramp 
from 0 IRE to either 80 or 100 IRE, superim- 
posed with modulated chrominance that has a 
phase of 0° +1° relative to the burst. The 80 
IRE ramp provides testing of the normal oper- 
ating range of the system; a 100 IRE ramp may 
be used to optionally test the entire operating 
range. The peak-to-peak modulated chromi- 
nance is 40 +0.5 IRE for (M) NTSC and 42.86 
+0.5 IRE for (B, D, G, H, I) PAL. Note a 0 IRE 
setup is used. The rise and fall times at the 
start and end of the modulated ramp envelope 
are 400 +25 ns (NTSC systems) or approxi- 
mately 1 us (PAL systems). This test signal 
may be used to measure differential gain. The 
modulated ramp signal is preferred over a 5- 
step or 10-step modulated staircase signal 
when testing digital systems. 


100 


WHITE LEVEL (100 IRE) 


BLANK LEVEL (0 IRE) 


SYNC LEVEL 


61.8 


Figure 8.33. Ten-Step Staircase Test Signal for NTSC and PAL. 
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Modulated Staircase 


The 5-step modulated staircase signal (a 10- 
step version is also used), shown in Figure 
8.35, consists of 5 luminance steps, superim- 
posed with modulated chrominance that has a 
phase of 0° +1° relative to the burst. The peak- 
to-peak modulated chrominance amplitude is 
40 +0.5 IRE for (M) NTSC and 42.86 +0.5 IRE 
for (B, D, G, H, I) PAL. Note a 0 IRE setup is 
used. The rise and fall times of each modula 
tion packet envelope are 400 +25 ns (NTSC 
systems) or approximately 1 us (PAL sys 
tems). The luminance IRE levels for the 5-step 
modulated staircase signal are shown in Fig- 
ure 8.35. This test signal may be used to mea- 
sure differential gain. The modulated ramp 
signal is preferred over a 5-step or 10-step 
modulated staircase signal when testing digital 
systems. 


COLOR BURST 


XS 


MICROSECONDS = 14.9 20.2 


Modulated Pedestal 


The modulated pedestal test signal (also called 
a three-level chrominance bar), shown in Fig- 
ure 8.36, is composed of a 50 IRE luminance 
pedestal, superimposed with three amplitudes 
of modulated chrominance that has a phase 
relative to the burst of —90° +1°. The peak-to- 
peak amplitudes of the modulated chromi- 
nance are 20 +0.5, 40 +0.5, and 80 +0.5 IRE for 
(M) NTSC and 20 +0.5, 60 +0.5, and 100 +0.5 
IRE for (B, D, G, H, I) PAL. Note a 0 IRE setup 
is used. The rise and fall times of each modula- 
tion packet envelope are 400 +25 ns (NTSC 
systems) or approximately 1 us (PAL sys- 
tems). This test signal may be used to measure 
chrominance-to-luminance — intermodulation 
and chrominance nonlinear gain. 
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Figure 8.34. 80 IRE Modulated Ramp Test Signal for NTSC and PAL. 
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Figure 8.35. Five-Step Modulated Staircase Test Signal for NTSC and PAL. 
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Figure 8.36. Modulated Pedestal Test Signal for NTSC and PAL. 
PAL IRE values are shown in parentheses. 
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Multiburst 


The multiburst test signal for (M) NTSC, 
shown in Figure 8.37, consists of a white flag 
with a peak amplitude of 100 +1 IRE and six 
frequency packets, each a specific frequency. 
The packets have a 40 +1 IRE pedestal with 
peak-to-peak amplitudes of 60 +0.5 IRE. Note a 
0 IRE setup is used and the starting and end- 
ing point of each packet is at zero phase. 

The ITU multiburst test signal for (B, D, G, 
H, I) PAL, shown in Figure 8.38, consists of a 4 
us white flag with a peak amplitude of 80 +1 
IRE and six frequency packets, each a specific 
frequency. The packets have a 50 +1 IRE ped- 
estal with peak-to-peak amplitudes of 60 +0.5 
IRE. Note the starting and ending points of 
each packet are at zero phase. The gaps 
between packets are 0.4—2.0 us. The ITU multi- 
burst test signal may be present on line 18. 

The multiburst signals are used to test the 
frequency response of the system by measur- 
ing the peak-to-peak amplitudes of the packets. 


100 IRE 


Line Bar 


The line bar is a single 100 +0.5 IRE (reference 
white) pulse of 10 us (PAL), 18 us (NTSC), or 
25 us (PAL) that occurs anywhere within the 
active scan line time (rise and fall times are < 1 
us). Note the color burst is not present, and a 0 
IRE setup is used. This test signal is used to 
measure line time distortion (line tilt or H tilt). 
A digital encoder or decoder does not generate 
line time distortion; the distortion is generated 
primarily by the analog filters and transmission 
channel. 


Multipulse 


The (M) NTSC multipulse contains a 2T pulse 
and 25T and 12.5T pulses with various high-fre- 
quency components, as shown in Figure 8.39. 
The (B, D, G, H, I) PAL multipulse is similar, 
except 20T and 10T pulses are used, and there 
is no 7.5 IRE setup. This test signal is typically 
used to measure the frequency response of the 
transmission channel. 
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Figure 8.37. Multiburst Test Signal for NTSC. 
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Figure 8.38. ITU Multiburst Test Signal for PAL. 
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BLANK LEVEL (0 IRE) 
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Figure 8.39. Multipulse Test Signal for NTSC and PAL. PAL values are shown in parentheses. 
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Field Square Wave 


The field square wave contains 100 +0.5 IRE 
pulses for the entire active line time for Field 1 
and blanked scan lines for Field 2. Note the 
color burst is not present and a 0 IRE setup is 
used. This test signal is used to measure field 
time distortion (field tilt or V tilt). A digital 
encoder or decoder does not generate field 
time distortion; the distortion is generated pri- 
marily by the analog filters and transmission 
channel. 


2T 
100 IRE 


3.58 MHZ 
COLOR BURST 
(9 CYCLES) 


NG 


+20 


Composite Test Signal 


NTC-7 Version for NTSC 

The NTC (U. S. Network Transmission Com- 
mittee) has developed a composite test signal 
that may be used to test several video parame- 
ters, rather than using multiple test signals. 
The NTC-7 composite test signal for NTSC 
systems (shown in Figure 8.40) consists of a 
100 IRE line bar, a 2T pulse, a 12.5T chromi- 
nance pulse, and a 5-step modulated staircase 
signal. 


12.5T 


BLANK LEVEL (0 IRE) 


MICROSECONDS = 12 


SYNC LEVEL (- 40 IRE) 


37 42 46 49 52 55 58 61 


Figure 8.40. NTC-7 Composite Test Signal for NTSC, With Corresponding IRE Values. 
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The line bar has a peak amplitude of 100 
+0.5 IRE, and 10-90% rise and fall times of 125 
+5 ns with an integrated sine-squared shape. It 
has a width at the 60 IRE level of 18 us. 

The 2T pulse has a peak amplitude of 100 
+0.5 IRE, with a half-amplitude width of 250 
+10 ns. 

The 12.5T chrominance pulse has a peak 
amplitude of 100 +0.5 IRE, with a half-ampli- 
tude width of 1562.5 +50 ns. 

The 5-step modulated staircase signal con- 
sists of 5 luminance steps superimposed with a 
40 +0.5 IRE subcarrier that has a phase of 0° 
+1° relative to the burst. The rise and fall times 
of each modulation packet envelope are 400 
+25 ns. 

The NTC-7 composite test signal may be 
present on line 17. 


100 IRE 


4.43 MHZ 
COLOR BURST 
(10 CYCLES) 


\ 


ITU Version for PAL 

The ITU (BT.628 and BT:473) has developed a 
composite test signal that may be used to test 
several video parameters, rather than using 
multiple test signals. The ITU composite test 
signal for PAL systems (shown in Figure 8.41) 
consists of a white flag, a 2T pulse, and a 5-step 
modulated staircase signal. 

The white flag has a peak amplitude of 100 
+1 IRE and a width of 10 us. 

The 2T pulse has a peak amplitude of 100 
+0.5 IRE, with a halfamplitude width of 200 
+10 ns. 

The 5-step modulated staircase signal con- 
sists of 5 luminance steps (whose IRE values 
are shown in Figure 8.41) superimposed with a 
42.86 +0.5 IRE subcarrier that has a phase of 
60° +1° relative to the U axis. The rise and fall 
times of each modulation packet envelope are 
approximately 1 us. 


60 


40 


0 BLANK LEVEL (0 IRE) 


MICROSECONDS = 12 22 26 


SYNC LEVEL (- 43 IRE) 


30 40 44 48 52 56 60 


Figure 8.41. ITU Composite Test Signal for PAL, With Corresponding IRE Values. 
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The ITU composite test signal may be 
present on line 330. 


U.K. Version 

The United Kingdom allows the use of a 
slightly different test signal since the 10T 
pulse is more sensitive to delay errors than the 
20T pulse (at the expense of occupying less 
chrominance bandwidth). Selection of an 
appropriate pulse width is a trade-off between 
occupying the PAL chrominance bandwidth as 
fully as possible and obtaining a pulse with suf- 
ficient sensitivity to delay errors. Thus, the 
national test signal (developed by the British 
Broadcasting Corporation and the Indepen- 
dent Television Authority) in Figure 8.42 may 
be present on lines 19 and 332 for (I) PAL sys- 
tems in the United Kingdom. 


100 IRE 


4.43 MHZ 
COLOR BURST 
(10 CYCLES) 


* 


The white flag has a peak amplitude of 100 
+1 IRE and a width of 10 us. 

The 2T pulse has a peak amplitude of 100 
+0.5 IRE, with a halfamplitude width of 200 
+10 ns. 

The 10T chrominance pulse has a peak 
amplitude of 100 +0.5 IRE. 

The 5-step modulated staircase signal con- 
sists of 5 luminance steps (whose IRE values 
are shown in Figure 8.42) superimposed with a 
21.43 +0.5 IRE subcarrier that has a phase of 
60° +1° relative to the U axis. The rise and fall 
times of each modulation packet envelope is 
approximately 1 us. 


10T 
100 


80 


60 


40 


20 


Loto BLANK LEVEL (0 IRE) 


MICROSECONDS = 12 


| SYNC LEVEL (- 43 IRE) 


30 34 40 44 48 52 56 60 


Figure 8.42. United Kingdom (1) PAL National Test Signal #1, 


With Corresponding IRE Values. 
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Combination Test Signal 


NTC-7 Version for NTSC 

The NTC (U. S. Network Transmission Com- 
mittee) has also developed a combination test 
signal that may be used to test several video 
parameters, rather than using multiple test sig- 
nals. The NTC-7 combination test signal for 
NTSC systems (shown in Figure 8.43) consists 
of a white flag, a multiburst, and a modulated 
pedestal signal. 

The white flag has a peak amplitude of 100 
+1 IRE and a width of 4 us. 

The multiburst has a 50 +1 IRE pedestal 
with peak-to-peak amplitudes of 50 +0.5 IRE. 
The starting point of each frequency packet is 
at zero phase. The width of the 0.5 MHz packet 
is 5 us; the width of the remaining packets is 3 


Us. 


100 IRE 


MHZ 


05 1 2 3 3.58 4.2 


The 3-step modulated pedestal is com- 
posed of a 50 IRE luminance pedestal, superim- 
posed with three amplitudes of modulated 
chrominance (20 +0.5, 40 +0.5, and 80 +0.5 IRE 
peak-to-peak) that have a phase of -90° +1° rel- 
ative to the burst. The rise and fall times of 
each modulation packet envelope are 400 +25 
ns. 

The NTC-7 combination test signal may be 
present on line 280. 


ITU Version for PAL 

The ITU (BT.473) has developed a combina- 
tion test signal that may be used to test several 
video parameters, rather than using multiple 
test signals. The ITU combination test signal 
for PAL systems (shown in Figure 8.44) con- 
sists of a white flag, a 2T pulse, a 20T modu- 
lated chrominance pulse, and a_ 5-step 
luminance staircase signal. 


COLOR BURST 


x 


50 IRE 


BLANK LEVEL (0 IRE) 


MICROSECONDS = 12 18 


SYNC LEVEL 


24 28 32 36 40 46 50 54 60 


Figure 8.43. NTC-7 Combination Test Signal for NTSC. 
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The line bar has a peak amplitude of 100 +1 
IRE and a width of 10 us. 

The 2T pulse has a peak amplitude of 100 
+0.5 IRE, with a half-amplitude width of 200 
+10 ns. 

The 20T chrominance pulse has a peak 
amplitude of 100 +0.5 IRE, with a half-ampli- 
tude width of 2.0 +0.06 us. 

The 5-step luminance staircase signal con- 
sists of 5 luminance steps, at 20, 40, 60, 80 and 
100 +0.5 IRE. 

The ITU combination test signal may be 
present on line 17. 


ITU ITS Version for PAL 

The ITU (BT.473) has developed a combina- 
tion ITS (insertion test signal) that may be 
used to test several PAL video parameters, 
rather than using multiple test signals. The 
ITU combination ITS for PAL systems (shown 
in Figure 8.45) consists of a 3-step modulated 
pedestal with peak-to-peak amplitudes of 20, 


2T 
100 IRE 


4.43 MHZ 
COLOR BURST 
(10 CYCLES) 


aN 


60, and 100 +1 IRE, and an extended subcar- 
rier packet with a peak-to-peak amplitude of 60 
+1 IRE. The rise and fall times of each subcar- 
rier packet envelope are approximately 1 us. 
The phase of each subcarrier packet is 60° 
+1° relative to the U axis. The tolerance on the 
50 IRE level is +1 IRE. 

The ITU composite ITS may be present on 
line 331. 


U.K. Version 
The United Kingdom allows the use of a 
slightly different test signal, as shown in Fig- 
ure 8.46. It may be present on lines 20 and 333 
for () PAL systems in the United Kingdom. 
The test signal consists of a 50 IRE lumi- 
nance bar, part of which has a 100 IRE subcar- 
rier superimposed that has a phase of 60° 
+1° relative to the U axis, and an extended 
burst of subcarrier on the second half of the 
scan line. 


20T 
100 IRE 


BLANK LEVEL (0 IRE) 


MICROSECONDS = 12 22 26 


SYNC LEVEL (- 43 IRE) 


32 40 44 48 52 56 62 


Figure 8.44. ITU Combination Test Signal for PAL. 
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100 


IRE 
80 
80 IRE 
60 
4.43 MHZ —| seis 
COLOR BURST 
(10 CYCLES) 
\ 40 
20 IRE 
20 
BLANK LEVEL (0 IRE) 
SYNC LEVEL (- 43 IRE) 
MICROSECONDS = 12 14 18 22. 28 34 60 


Figure 8.45. ITU Combination ITS Test Signal for PAL. 
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Figure 8.46. United Kingdom (1) PAL National Test Signal #2. 
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T Pulse 


Square waves with fast rise times cannot be 
used for testing video systems, since attenua- 
tion and phase shift of out-of-band components 
cause ringing in the output signal, obscuring 
the in-band distortions being measured. T, or 
sin’, pulses are bandwidth-limited, so are used 
for testing video systems. 

The 2T pulse is shown in Figure 8.47 and, 
like the T pulse, is obtained mathematically by 
squaring a half-cycle of a sine wave. T pulses 
are specified in terms of half amplitude dura- 
tion (HAD), which is the pulse width measured 
at 50% of the pulse amplitude. Pulses with 
HADs that are multiples of the time interval T 


250 NS 


> TIME (NS) 
-250 0 250 


are used to test video systems. As seen in Fig- 
ures 8.39 through 8.44, T, 2T, 12.5T and 25T 
pulses are common when testing NTSC video 
systems, whereas T, 2T, 10T; and 20T pulses 
are common for PAL video systems. 

T is the Nyquist interval or 


1/2F¢ 


where F¢ is the cutoff frequency of the video 
system. For NTSC, Fc is 4 MHz, whereas Fc 
for PAL systems is 5 MHz. Therefore, T for 
NTSC systems is 125 ns and for PAL systems it 
is 100 ns. For a T pulse with a HAD of 125 ns, a 
2T pulse has a HAD of 250 ns, and so on. The 
frequency spectra for the 2T pulse is shown in 


> TIME (NS) 


(C) 


= FREQUENCY (MHZ) 


Figure 8.47. The T Pulse. (a) 2T pulse. (b) 2T step. (c) Frequency spectra of the 2T pulse. 
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Figure 8.47 and is representative of the energy 
content in a typical character generator wave- 
form. 

To generate smooth rising and falling 
edges of most video signals, a T step (gener- 
ated by integrating a T pulse) is typically used. 
T steps have 10-90% rise/fall times of 0.964T 
and a well-defined bandwidth. The 2T step gen- 
erated from a 2T pulse is shown in Figure 8.47. 

The 12.5T chrominance pulse, illustrated 
in Figure 8.48, is a good test signal to measure 
any chrominance-to-luminance timing error 
since its energy spectral distribution is 
bunched in two relatively narrow bands. Using 
this signal detects differences in the luminance 
and chrominance phase distortion, but not 
between other frequency groups. 


1562.5 NS 


> TIME (NS) 
-1562.5 0 1562.5 


> TIME (NS) 


-0.5 


(B) 


VBI Data 


VBI (vertical blanking interval) data may be 
inserted up to about 5 scan lines into the active 
picture region to ensure it won't be deleted by 
equipment replacing the VBI, by DSS MPEG 
which deletes the VBI, or by cable systems 
inserting their own VBI data. This is common 
practice by Neilson and others to ensure their 
programming and commercial tracking data 
gets through the distribution systems to the 
receivers. In most cases, this will be unseen 
since it is masked by the TV’s overscan. 


Timecode 


Two types of time coding are commonly used, 
as defined by ANSI/SMPTE 12M and IEC 461: 


1562.5 NS 


> TIME (NS) 
-1562.5 0 1562.5 


(C) 


Figure 8.48. The 12.5T Chrominance Pulse. (a) Luma component. 
(b) Chroma component. (c) Addition of (a) and (b). 
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longitudinal timecode (LTC) and vertical inter- 
val timecode (VITC). 

The LTC is recorded on a separate audio 
track; as a result, the analog VCR must use 
high-bandwidth amplifiers and audio heads. 
This is due to the time code frequency increas- 
ing as tape speed increases, until the point that 
the frequency response of the system results 
in a distorted time code signal that may not be 
read reliably. At slower tape speeds, the time 
code frequency decreases, until at very low 
tape speeds or still pictures, the time code 
information is no longer recoverable. 

The VITC is recorded as part of the video 
signal; as a result, the time code information is 
always available, regardless of the tape speed. 
However, the LTC allows the time code signal 
to be written without writing a video signal; the 
VITC requires the video signal to be changed if 
a change in time code information is required. 
The LTC therefore is useful for synchronizing 
multiple audio or audio/video sources. 


Frame Dropping 

If the field rate is 60/1.001 fields per second, 
straight counting at 60 fields per second yields 
an error of about 108 frames for each hour of 
running time. This may be handled in one of 
three ways: 


Nondrop frame: During a continuous 
recording, each time count increases 
by 1 frame. In this mode, the drop 
frame flag will be a “0.” 


Drop frame: To minimize the timing 
error, the first two frame numbers (00 
and 01) at the start of each minute, 
except for minutes 00, 10, 20, 30, 40, 
and 50, are omitted from the count. In 
this mode, the drop frame flag will be a 
a Deca 


Drop frame for (M) PAL: To minimize 
the timing error, the first four frame 
numbers (00 to 03) at the start of 
every second minute (even minute 
numbers) are omitted from the count, 
except for minutes 00, 20, and 40. In 
this mode, the drop frame flag will be a 
Sie? 


Even with drop framing, there is a long- 
term error of about 2.26 frames per 24 hours. 
This error accumulation is the reason time- 
code generators must be periodically reset if 
they are to maintain any correlation to the cor- 
rect time-of-day. Typically, this “reset-to-real- 
time” is referred to as a “jam sync” procedure. 
Some jam sync implementations reset the 
timecode to 00:00:00.00 and, therefore, must 
occur at midnight; others allow a true re-sync 
to the correct time-of-day. 

One inherent problem with jam sync cor- 
rection is the interruption of the timecode. 
Although this discontinuity may be brief, it 
may cause timecode readers to “hiccup” due to 
the interruption. 


Longitudinal Timecode (LTC) 

The LTC information is transferred using a 
separate serial interface, using the same elec- 
trical interface as the AES/EBU digital audio 
interface standard, and is recorded on a sepa- 
rate track. The basic structure of the time data 
is based on the BCD system. Tables 8.20 and 
8.21 list the LTC bit assignments and arrange- 
ment. Note the 24-hour clock system is used. 


LTC Timing 

The modulation technique is such that a transi- 
tion occurs at the beginning of every bit 
period. “1” is represented by a second transi- 
tion one-half a bit period from the start of the 
bit. “O” is represented when there is no transi- 
tion within the bit period (see Figure 8.49). 
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Bit(s) Function Note Bit(s) Function Note 
0-3 units of frames 58 flag 5 note 5 
4-7 user group 1 59 flag 6 note 6 
8-9 tens of frames 60-63 user group 8 

10 flag 1 note 1 64 sync bit fixed”0” 
11 flag 2 note 2 65 sync bit fixed”0” 

12-15 user group 2 66 sync bit fixed” 1” 

16-19 units of seconds 67 sync bit fixed” 1” 

20-23 user group 3 68 sync bit fixed” 1” 

24-26 tens of seconds 69 sync bit fixed” 1” 
27 flag 3 note 3 70 sync bit fixed” 1” 

28-31 user group 4 71 sync bit fixed” 1” 

32-35 units of minutes 72 sync bit fixed” 1” 

36-39 user group 5 73 sync bit fixed” 1” 

40-42 tens of minutes 74 sync bit fixed” 1” 
43 flag 4 note 4 75 sync bit fixed” 1” 

44-47 user group 6 76 sync bit fixed” 1” 

48-51 units of hours 77 sync bit fixed” 1” 

52-55 user group 7 78 sync bit fixed”0” 

56-57 tens of hours 79 sync bit fixed” 1” 

Notes: 

1. Drop frame flag. 525-line and 1125-line systems: “1” if frame numbers are being dropped, “0” if no frame drop- 
ping is done. 625-line systems: “0.” 

2. Color frame flag. 525-line systems: “1” if even units of frame numbers identify fields 1 and 2 and odd units of 
field numbers identify fields 3 and 4. 625-line systems: “1” if timecode is locked to the video signal in accor- 
dance with 8-field sequence and the video signal has the “preferred subcarrier-to-line-sync phase.” 1125-line 
systems: “0.” 

3. 525-line and 1125-line systems: Phase correction. This bit shall be put in a state so that every 80-bit word con- 
tains an even number of “0”s. 625-line systems: Binary group flag 0. 

4, 525-line and 1125-line systems: Binary group flag 0. 625-line systems: Binary group flag 2. 

5. Binary group flag 1. 

6. 525-line and 1125-line systems: Binary group flag 2. 625-line systems: Phase correction. This bit shall be put 


in a state so that every 80-bit word contains an even number of “0”s. 


Table 8.20. LTC Bit Assignments. 
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Frames (count 0-29 for 525-line and 1125-line systems, 0-24 for 625-line systems) 


units of frames (bits 0-3) 4-bit BCD (count 0-9); bit 0 is LSB 

tens of frames (bits 8-9) 2-bit BCD (count 0-2); bit 8 is LSB 
Seconds 

units of seconds (bits 16-19) 4-bit BCD (count 0-9); bit 16 is LSB 

tens of seconds (bits 24—26) 3-bit BCD (count 0-5); bit 24 is LSB 
Minutes 

units of minutes (bits 32-35) 4-bit BCD (count 0-9); bit 32 is LSB 

tens of minutes (bits 40-42) 3-bit BCD (count 0-5); bit 40 is LSB 

Hours 
units of hours (bits 48-51) 4-bit BCD (count 0-9); bit 48 is LSB 
tens of hours (bits 56-57) 2-bit BCD (count 0-2); bit 56 is LSB 


Table 8.21. LTC Bit Arrangement. 


Figure 8.49. LTC Data Bit Transition Format. 
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The signal has a peak-to-peak amplitude of 0.5- 
4.5V, with rise and fall times of 40 +10 us (10% 
to 90% amplitude points). 

Because the entire frame time is used to 
generate the 80-bit LTC information, the bit 
rate (in bits per second) may be determined 
by: 


Fo = 80 Fy 


where Fy is the vertical frame rate in frames 
per second. The 80 bits of time code informa- 
tion are output serially, with bit 0 being first. 
The LTC word occupies the entire frame time, 
and the data must be evenly spaced through- 
out this time. The start of the LTC word occurs 
at the beginning of line 5 +1.5 lines for 525-line 
systems, at the beginning of line 2 +1.5 lines 
for 625-line systems, and at the vertical sync 
timing reference of the frame +1 line for 1125- 
line systems. 


Vertical Interval Time Code (VITC) 

The VITC is recorded during the vertical 
blanking interval of the video signal in both 
fields. Since it is recorded with the video, it can 
be read in still mode. However, it cannot be re- 
recorded (or restriped). Restriping requires 
dubbing down a generation, deleting and 
inserting a new time code. For YPbPr and S- 
video interfaces, VITC is present on the Y sig- 
nal. For analog RGB interfaces, VITC is 
present on all three signals. 

As with the LTC, the basic structure of the 
time data is based on the BCD system. Tables 
8.22 and 8.23 list the VITC bit assignments and 
arrangement. Note the 24-hour clock system is 
used. 


VITC Cyclic Redundancy Check 

Eight bits (82-89) are reserved for the code 
word for error detection by means of cyclic 
redundancy checking. The generating polyno- 


mial, x® 4 1, applies to all bits from 0 to 81, 
inclusive. Figure 8.50 illustrates implementing 
the polynomial using a shift register. During 
passage of timecode data, the multiplexer is in 
position 0 and the data is output while the CRC 
calculation is done simultaneously by the shift 
register. After all the timecode data has been 
output, the shift register contains the CRC 
value, and switching the multiplexer to posi- 
tion 1 enables the CRC value to be output. 
When the process is repeated on decoding, the 
shift register should contain all zeros if no 
errors exist. 


VITC Timing 

The modulation technique is such that each 
state corresponds to a binary state, and a tran- 
sition occurs only when there is a change in 
the data between adjacent bits from a “1” to “0” 
or “0” to “1.” No transitions occur when adja- 
cent bits contain the same data. This is com- 
monly referred to as “non-return to zero” 
(NRZ). Synchronization bit pairs are inserted 
throughout the VITC data to assist the receiver 
in maintaining the correct frequency lock. 

The bit rate (F(¢) is defined to be: 


Fo= 115 Fy + 2% 


where Fy is the horizontal line frequency. The 
90 bits of time code information are output 
serially, with bit 0 being first. For 625-line inter- 
laced systems, lines 19 and 332 are commonly 
used for the VITC. For 525-line interlaced sys- 
tems, lines 14 and 277 are commonly used. For 
1125-line interlaced systems, lines 9 and 571 
are commonly used. To protect the VITC 
against drop-outs, it may also be present two 
scan lines later, although any two nonconsecu- 
tive scan lines per field may be used. 

Figure 8.51 illustrates the timing of the 
VITC data on the scan line. The data must be 
evenly spaced throughout the VITC word. The 
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Bit(s) Function Note Bit(s) Function Note 

0 sync bit fixed “1” 42-45 units of minutes 

1 sync bit fixed “0” 46-49 user group 5 

2-5 units of frames 50 sync bit fixed “1” 

6-9 user group 1 51 sync bit fixed “0” 

10 sync bit fixed “1” 52-54 tens of minutes 

11 sync bit fixed “0” 55 flag 4 note 4 
12-13 tens of frames 56-59 user group 6 

14 flag 1 note 1 60 sync bit fixed “1” 

15 flag 2 note 2 61 sync bit fixed”0” 
16-19 user group 2 62-65 units of hours 

20 sync bit fixed “1” 66-69 user group 7 

21 sync bit fixed “0” 70 sync bit fixed” 1” 
22-25 units of seconds 71 sync bit fixed”0” 
26-29 user group 3 72-73 tens of hours 

30 sync bit fixed “1” 74 flag 5 note 5 

31 sync bit fixed “0” 75 flag 6 note 6 
32-34 tens of seconds 76-79 user group 8 

35 flag 3 note 3 80 sync bit fixed” 1” 
36-39 user group 4 81 sync bit fixed”0” 

40 sync bit fixed “1” 82-89 CRC group 

41 sync bit fixed “0” 

Notes: 


1. Drop frame flag. 525-line and 1125-line systems: “1” if frame numbers are being dropped, “0” if no frame 
dropping is done. 625-line systems: “0.” 

2. Color frame flag. 525-line systems: “1” if even units of frame numbers identify fields 1 and 2 and odd 
units of field numbers identify fields 3 and 4. 625-line systems: “1” if timecode is locked to the video sig- 
nal in accordance with 8-field sequence and the video signal has the “preferred subcarrier-to-line-sync 
phase.” 1125-line systems: “0.” 

3. 525-line systems: Field flag. “0” during fields 1 and 3, “1” during fields 2 and 4. 625-line systems: Binary 
group flag 0. 1125-line systems: Field flag. “0” during field 1, “1” during field 2. 

4, 525-line and 1125-line systems: Binary group flag 0. 625-line systems: Binary group flag 2. 

Binary group flag 1. 

6. 525-line and 1125-line systems: Binary group flag 2. 625-line systems: Field flag. “0” during fields 1, 3, 5, 
and 7, “1” during fields 2, 4, 6, and 8. 


on 


Table 8.22. VITC Bit Assignments. 


VBI Data 


327 


Frames (count 0-29 for 525-line and 1125-line systems, 0-24 for 625-line systems) 


units of frames (bits 2-5) 4-bit BCD (count 0-9); bit 2 is LSB 

tens of frames (bits 12-13) 2-bit BCD (count 0-2); bit 12 is LSB 
Seconds 

units of seconds (bits 22-25) 4-bit BCD (count 0-9); bit 22 is LSB 

tens of seconds (bits 32-34) 3-bit BCD (count 0-5); bit 32 is LSB 
Minutes 

units of minutes (bits 42-45) 4-bit BCD (count 0-9); bit 42 is LSB 

tens of minutes (bits 52-54) 3-bit BCD (count 0-5); bit 52 is LSB 

Hours 
units of hours (bits 62-65) 4-bit BCD (count 0-9); bit 62 is LSB 
tens of hours (bits 72-73) 2-bit BCD (count 0-2); bit 72 is LSB 


Table 8.23. VITC Bit Arrangement. 


DATA 0 
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Figure 8.50. VITC CRC Generation. 
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63.556 uS (115 BITS) 


50.286 uS (90 BITS) 


10 pS MIN 
~ — = —— = 2.1uS MIN 
(19 BITS) “ 
80 +10 
IRE 
J | 
525/59.94 SYSTEMS 
HSYNC HSYNC 
64 uS (115 BITS) 
j _ 
11.2 uS MIN 49.655 pS (90 BITS) 
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Figure 8.51. VITC Position and Timing. 
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User Bits Content | Timecode Referenced BGF2 BGF1 BGFO 
user defined no 0 0 0 
8-bit character set! no 0 0 1 
user defined yes 0 1 0 
reserved unassigned 0 1 1 
date and time zone® no 1 0 0 
page / line? no 1 0 1 
date and time zone® yes 1 1 0 
page / line? yes 1 1 1 


Notes: 


1. Conforming to ISO/IEC 646 or 2022. 
2. Described in SM PTE 262M. 
3. Described in SM PTE 309M. See Tables 8.25 through 8.27. 


Table 8.24. LTC and VITC Binary Group Flag (BGF) Bit Definitions. 


USER 
GROUPS 


Naw = 
aonAD 


SEP GEs SieIN es 


7-BIT ISO: B1 B2 B3 B4 B5 B6 B7 0 


8-BIT ISO: A1 A2 A3 A4 AS A6 A7 A8& 


ONE ISO CHARACTER 


~ _ 


Figure 8.52. Use of Binary Groups to Describe 
ISO Characters Coded With 7 or 8 Bits. 
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User Group 8 


User Group 7 


Bit 3 Bit2 | Bitl | BitO | Bit3 | Bit2 | Bit1 | BitoO 
MJD Flag 0 time zone offset code 00y-3F y 
Notes: 


1. MJD flag: “0” = YYMMDD format, “1” = MJD format. 


Table 8.25. Date and Time Zone Format Coding. 


Value Description 


0-9 day units 


0-3 day units 


0-9 month units 


0,1 month units 


0-9 year units 


is cap Assignment 
1 D 
2 D 
3 M 
4 M 
5 Y 
6 Y 


0-9 year units 


Table 8.26. YYMMDD Date Format. 


10% to 90% rise and fall times of the VITC bit 
data should be 200 +50 ns (525-line and 625- 
line systems) or 100 +25 ns (1125-line systems) 
before adding it to the video signal to avoid 
possible distortion of the VITC signal by down- 
stream chrominance circuits. In most circum- 
stances, the analog lowpass filters after the 
video D/A converters should suffice for the fil- 
tering. 


User Bits 

The binary group flag (BGF) bits shown in 
Table 8.24 specify the content of the 32 user 
bits. The 32 user bits are organized as eight 
groups of four bits each. 


The user bits are intended for storage of 
data by users. The 32 bits may be assigned in 
any manner without restriction, if indicated as 
user-defined by the binary group flags. 

If an 8bit character set conforming to 
ISO/IEC 646 or 2022 is indicated by the binary 
group flags, the characters are to be inserted 
as shown in Figure 8.52. Note that some user 
bits will be decoded before the binary group 
flags are decoded; therefore, the decoder must 
store the early user data before any processing 
is done. 
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Code Hours Code Hours Code Hours 
00 UTC 16 UTC + 10.00 2C UTC + 09.30 
01 UTC - 01.00 17 UTC + 09.00 2D UTC + 08.30 
02 UTC — 02.00 18 UTC + 08.00 2E UTC + 07.30 
03 UTC — 03.00 19 UTC + 07.00 2F UTC + 06.30 
04 UTC — 04.00 1A UTC — 06.30 30 TP-1 
05 UTC — 05.00 1B UTC - 07.30 31 TP-0 
06 UTC - 06.00 1C UTC — 08.30 32 UTC + 12.45 
07 UTC — 07.00 1D UTC — 09.30 33 reserved 
08 UTC — 08.00 1E UTC - 10.30 34 reserved 
09 UTC —- 09.00 1F UTC - 11.30 35 reserved 
AN UTC — 00.30 20 UTC + 06.00 36 reserved 
0B UTC - 01.30 21 UTC + 05.00 37 reserved 
0C UTC - 02.30 22 UTC + 04.00 38 user defined 
0D UTC — 03.30 23 UTC + 03.00 39 unknown 
OE UTC — 04.30 24 UTC + 02.00 3A UTC + 05.30 
OF UTC — 05.30 25 UTC + 01.00 3B UTC + 04.30 
10 UTC - 10.00 26 reserved 3C UTC + 03.30 
11 UTC - 11.00 27 reserved 3D UTC + 02.30 
12 UTC - 12.00 28 TP-3 3E UTC + 01.30 
13 UTC + 13.00 29 TP-2 3F UTC + 00.30 
14 UTC + 12.00 2A UTC + 11.30 
15 UTC + 11.00 2B UTC + 10.30 


Table 8.27. Time Zone Offset Codes. 
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When the user groups are used to transfer 
time zone and date information, user groups 7 
and 8 specify the time zone and the format of 
the date in the remaining six user groups, as 
shown in Tables 8.25 and 8.27. The date may 
be either a six-digit YYMMDD format (Table 
8.26) or a six-digit modified Julian date (MJD), 
as indicated by the MJD flag. 


Closed Captioning 


This section reviews closed captioning for the 
hearing impaired in the United States. Closed 
captioning and text are transmitted during the 
blanked active line-time portion of lines 21 and 
284. 

Extended data services (XDS) also may be 
transmitted during the blanked active line-time 


portion of line 284. XDS may indicate the pro- 
gram name, time into the show, time remain- 
ing to the end, and so on. 

Note that due to editing before transmis- 
sion, it may be possible that the caption infor- 
mation is occasionally moved down a scan line 
or two. Therefore, caption decoders should 
monitor more than just lines 21 and 284 for 
caption information. 


Waveform 

The data format for both lines consists of a 
clock run-in signal, a start bit, and two 7-bit 
plus parity words of ASCII data (per X3.4- 
1967). For YPbPr and S-video interfaces, cap- 
tioning is present on the Y signal. For analog 
RGB interfaces, captioning is present on all 
three signals. 


10.5+0.25 pS 12.91 us 
Ln rales 
7 CYCLES TWO 7-BIT + PARITY 
OF 0.5035 MHZ ASCII CHARACTERS 
(CLOCK RUN-IN) (DATA) Saesacine 
i RISE / FALL 
3.58 MHZ s Do-D6 P DO-D6 p |— TIMES 
COLOR BURST Tf: A A (2T BAR 
60 <ac IRE (9 CYCLES) A R R SHAPING) 
R I } 
T T T 
Y Y 
BLANK LEVEL | —— 
40 IRE 
SYNC LeveL —! 
10.003 + 0.25 pS 
L. -\~ | 
27.382 US 33.764 US 


Figure 8.53. 525-Line Lines 21 and 284 Closed Captioning Timing. 
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Figure 8.53 illustrates the waveform and 
timing for transmitting the closed captioning 
and XDS information and conforms to the Tele- 
vision Synchronizing Waveform for Color 
Transmission in Subpart E, Part 73 of the FCC 
Rules and Regulations and EJA-608. The clock 
run-in is a 7-cycle sinusoidal burst that is fre- 
quency-locked and phase-locked to the caption 
data and is used to provide synchronization for 
the decoder. The nominal data rate is 32x Fy. 
However, decoders should not rely on this tim- 
ing relationship due to possible horizontal tim- 
ing variations introduced by video processing 
circuitry and VCRs. After the clock run-in sig- 
nal, the blanking level is maintained for a two 
data bit duration, followed by a “1” start bit. 
The start bit is followed by 16 bits of data, com- 
posed of two 7-bit + odd parity ASCII charac- 
ters. Caption data is transmitted using a non- 
return-to-zero (NRZ) code; a “1” corresponds 
to the 50 + 2 IRE level and a “0” corresponds to 
the blanking level (0-2 IRE). The negative- 
going crossings of the clock are coherent with 
the data bit transitions. 

Typical decoders specify the time between 
the 50% points of sync and clock run-in to be 
10.5 +0.5 us, with a +3% tolerance on Fy, 50 +12 
IRE for a “1” bit, and -2 to +12 IRE for a “0” bit. 
Decoders must also handle bit rise/fall times 
of 240-480 ns. 

NUL characters (004) should be sent 
when no display or control characters are 
being transmitted. This, in combination with 
the clock run-in, enables the decoder to deter- 
mine whether captioning or text transmission 
is being implemented. 

If using only line 21, the clock run-in and 
data do not need to be present on line 284. 
However, if using only line 284, the clock run- 
in and data should be present on both lines 21 
and 284; data for line 21 would consist of NUL 
characters. 


At the decoder, as shown in Figure 8.54, 
the display area of a 525-line 4:3 interlaced dis- 
play is typically 15 rows high and 34 columns 
wide. The vertical display area begins on lines 
43 and 306 and ends on lines 237 and 500. The 
horizontal display area begins 13 us and ends 
58 us, after the leading edge of horizontal sync. 

In text mode, all rows are used to display 
text; each row contains a maximum of 32 char- 
acters, with at least a one-column wide space 
on the left and right of the text. The only trans- 
parent area is around the outside of the text 
area. 

In caption mode, text usually appears only 
on rows 1-4 or 12-15; the remaining rows are 
usually transparent. Each row contains a maxi- 
mum of 32 characters, with at least a one-col- 
umn wide space on the left and right of the 
text. 

Some caption decoders support up to 48 
columns per row, and up to 16 rows, allowing 
some customization for the display of caption 
data. 


Basic Services 

There are two types of display formats: text 
and captioning. In understanding the operation 
of the decoder, it is easier to visualize an invisi- 
ble cursor that marks the position where the 
next character will be displayed. Note that if 
you are designing a decoder, you should obtain 
the latest FCC Rules and Regulations and EIA- 
608 to ensure correct operation, as this section 
is only asummary. 


Text Mode 

Text mode uses 7-15 rows of the display and is 
enabled upon receipt of the Resume Text Dis- 
play or Text Restart code. When text mode has 
been selected, and the text memory is empty, 
the cursor starts at the top-most row, character 
1 position. Once all the rows of text are dis- 
played, scrolling is enabled. 
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With each carriage return received, the 
top-most row of text is erased, the text is rolled 
up one row (over a maximum time of 0.433 sec- 
onds), the bottom row is erased, and the cur- 
sor is moved to the bottom row, character 1 
position. If new text is received while scrolling, 
it is seen scrolling up from the bottom of the 
display area. If a carriage return is received 
while scrolling, the rows are immediately 
moved up one row to their final position. 

Once the cursor moves to the character 32 
position on any row, any text received before a 
carriage return, preamble address code, or 
backspace will be displayed at the character 32 
position, replacing any previous character at 
that position. The Text Restart command 
erases all characters on the display and moves 
the cursor to the top row, character 1 position. 


Captioning Mode 
Captioning has several modes available, includ- 
ing roll-up, pop-on, and paint-on. 

Roll-up captioning is enabled by receiving 
one of the miscellaneous control codes to 
select the number of rows displayed. “Roll-up 
captions, 2 rows” enables rows 14 and 15; “roll- 
up captions, 3 rows” enables rows 13-15, “roll- 
up captions, 4 rows” enables rows 12-15. 
Regardless of the number of rows enabled, the 
cursor remains on row 15. Once row 15 is full, 
the rows are scrolled up one row (at the rate of 
one dot per frame), and the cursor is moved 
back to row 15, character 1. 

Pop-on captioning may use rows 1-4 or 12- 
15, and is initiated by the Resume Caption 
Loading command. The display memory is 
essentially double-buffered. While memory 
buffer 1 is displayed, memory buffer 2 is being 
loaded with caption data. At the receipt of a 


LINE COUNT 


45.02 pS (34 CHARACTERS) 


LINE 43 


ROW 1 


CAPTIONS 
OR 
INFOTEXT 


INFOTEXT 
ONLY 


CAPTIONS 
OR 
INFOTEXT 


Figure 8.54. Closed Captioning Display Format. 
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End of Caption code, memory buffer 2 is dis- 
played while memory buffer 1 is being loaded 
with new caption data. 

Paint-on captioning, enabled by the 
Resume Direct Captioning command, is simi- 
lar to Pop-on captioning, but no double-buffer- 
ing is used; caption data is loaded directly into 
display memory. 

Three types of control codes (preamble 
address codes, midrow codes, and miscella- 
neous control codes) are used to specify the 
format, location, and attributes of the charac- 
ters. Each control code consists of two bytes, 
transmitted together on line 21 or line 284. On 
line 21, they are normally transmitted twice in 
succession to help ensure correct reception. 
They are not transmitted twice on line 284 to 
minimize bandwidth used for captioning. 

The first byte is a nondisplay control byte 
with a range of 10y to 1Fy; the second byte is a 
display control byte in the range of 20y to 7Fy. 
At the beginning of each row, a control code is 
sent to initialize the row. Caption roll-up and 
text modes allow either a preamble address 


PREAMBLE CONTROL CODE 
(TRANSMITTED TWICE) 


code or midrow control code at the start of a 
row; the other caption modes use a preamble 
address code to initialize a row. The preamble 
address codes are illustrated in Figure 8.55 
and Table 8.28. 

The midrow codes are typically used 
within a row to change the color, italics, under- 
line, and flashing attributes and should occur 
only between words. Color, italics, and under- 
line are controlled by the preamble address 
and midrow codes; flash on is controlled by a 
miscellaneous control code. An _ attribute 
remains in effect until another control code is 
received or the end of row is reached. Each 
row starts with a control code to set the color 
and underline attributes (white nonunderlined 
is the default if no control code is received 
before the first character on an empty row). 
The color attribute can be changed only by the 
midrow code of another color; the italics 
attribute does not change the color attribute. 
However, a color attribute turns off the italics 
attribute. The flash on command does not alter 
the status of the color, italics, or underline 


CAPTION TEXT UP TO 32 
CHARACTERS PER ROW 


NON-DISPLAY 
CONTROL ODD 
CHARACTER | PARITY 
(7 BITS BIT 
LSB FIRST) 


DISPLAY 
CONTROL ODD 
CHARACTER | PARITY 
(7 BITS BIT 
LSB FIRST) 


START 
BIT 


FIRST SECOND 
START TEXT ODD TEXT ODD 
BIT CHARACTER | PARITY | CHARACTER | PARITY 
(7 BITS BIT (7 BITS BIT 


LSB FIRST) LSB FIRST) 


IDENTIFICATION CODE, ROW POSITION, INDENT, 
AND DISPLAY CONDITION INSTRUCTIONS 


BEGINNING OF DISPLAYED 
CAPTION 


Figure 8.55. Closed Captioning Preamble Address Code Format. 
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Non-display Control Byte Display Control Byte 
Row Position 
D6 | D5 | D4 | D3 | D2 | D1 | DO | D6 | D5 | D4 | D3 | D2 | D1 | DO 
1 0 1 
0 0 1 
1 1 2 
1 0 3 
0 1 0 
1 1 4 
1 0 +) 
1 0 1 
1 1 6 
1 1 0 : 2 i 
0 0 1 CH 1 1 A B Cc D U 8 
1 0 9 
1 1 1 
1 1 10 
0 0 0 1 0 11 
1 0 12 
0 1 1 
1 1 13 
1 0 14 
1 0 0 
1 1 15 


Notes: 
1. U: “0” = no underline, “1” = underline. 
2. CH: “0” = data channel 1, “1” = data channel 2. 


Attribute 


white 


green 
blue 
cyan 


red 


yellow 


magenta 
italics 
indent 0, white 


indent 4, white 
indent 8, white 
indent 12, white 
indent 16, white 
indent 20, white 
indent 24, white 
indent 28, white 


RR TRL RL RL RILRILRILOLOILOILOIOIOIO/IOC!] B&B 


Ble ]LRILR IL OLOlLOlLOlRIRIRIRILOlLoOlol/o!] Ww 
BL RI OLO]TRIRIOlLOlRIRIOloOl[RIRIoOlo| oO 
RILOL RIL OLR I OLRIO]LRIOIRICIRICIRIC!] UO 


Table 8.28. Closed Captioning Preamble Address Codes. In text mode, the indent codes 
may be used to perform indentation; in this instance, the row information is ignored. 
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attributes. However, a color or italics midrow 
control code turns off the flash. Note that the 
underline color is the same color as the charac- 
ter being underlined; the underline resides on 
dot row 11 and covers the entire width of the 
character column. 

Table 8.29, Figure 8.56, and Table 8.30 
illustrate the midrow and miscellaneous con- 
trol code operation. For example, if it were the 
end of a caption, the control code could be End 
of Caption (transmitted twice). It could be fol- 
lowed by a preamble address code (transmit- 
ted twice) to start another line of captioning. 

Characters are displayed using a dot 
matrix format. Each character cell is typically 
16 samples wide and 26 samples high (16 x 
26), as shown in Figure 8.57. Dot rows 2-19 
are usually used for actual character outlines. 
Dot rows 0, 1, 20, 21, 24, and 25 are usually 
blanked to provide vertical spacing between 
characters, and underlining is typically done 


on dot rows 22 and 23. Dot columns 0, 1, 14 
and 15 are blanked to provide horizontal spac- 
ing between characters, except on dot rows 22 
and 23 when the underline is displayed. This 
results in 12 x 18 characters stored in charac- 
ter ROM. Table 8.31 shows the basic character 
set. 

Some caption decoders support multiple 
character sizes within the 16 x 26 region, 
including 13 x 16, 13 x 24, 12 x 20, and 12 x 26. 
Not all combinations generate a sensible result 
due to the limited display area available. 


Optional Captioning Features 
Three sets of optional features are available for 
advanced captioning decoders. 


Optional Attributes 

Additional color choices are available for 
advanced captioning decoders, as shown in 
Table 8.32. 


Non-display Control Byte 


Display Control Byte 


D6 | D5 | D4 | D3 | D2 | D1 | DO | D6 | DS 


Attribute 
D4 


~] 
w 
iw] 
iS) 
i] 
Bp 


DO 


white 


green 
blue 
cyan 


red 


yellow 


magenta 
italics 


Ble |E|olololo 

BLE ololRlHRlolo 

BlLOolRlolRlolHilo 
q 


Notes: 
1. U: “0” = no underline, “1” = underline. 
2. CH: “0” = data channel 1, “1” = data channel 2. 


3. Italics is implemented as a two-dot slant to the right over the vertical range of the character. Some 
decoders implement a one dot slant for every four scan lines. Underline resides on dot rows 22 and 


23, and covers the entire column width. 


Table 8.29. Closed Captioning Midrow Codes. 
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TEXT MID-ROW CONTROL CODE 
(TRANSMITTED TWICE) 


—— | 
TEAL ODD eee ODD panes i ODD pene) {o}p)p) 
START | CHARACTER CHARACTER START 
PARITY PARITY CHARACTER | PARITY | CHARACTER | PARITY 
a (7 BITS BIT (BIS BIT ey (7 BITS BIT (7 BITS BIT 
LSB FIRST) LSB FIRST) 
LSB FIRST) LSB FIRST) 


Figure 8.56. Closed Captioning Midrow Code Format. Miscellaneous control codes may also be 
transmitted in place of the midrow control code. 


Non-display Control Byte Display Control Byte 


Command 
D6 | D5 | D4 | D3 | D2 | D1 | DO | D6 | D5 | D4 


1] 
w 
i] 
i) 
i] 
B 
~] 
° 


resume caption loading 


backspace 


reserved 


reserved 


delete to end of row 


roll-up captions, 2 rows 


roll-up captions, 3 rows 


roll-up captions, 4 rows 


flash on 


resume direct captioning 


text restart 


resume text display 


erase displayed memory 


carriage return 


erase nondisplayed memory 


end of caption (flip memories) 


tab offset (1 column) 


tab offset (2 columns) 


SIL Ol Ol RyRy RPlRelRPlRelReylR|lololaolaoalol/olo!|co 
SIL Ol Ol Ry Rel_RlRPloOl Ol OlOl Rl RelRelRlololol!lo 
RL RL OLR RIL OlOl RI RIL OloOlRIRlololRIi]Rlolo 
MLO RIL RLOl RI OL RIL OlRIOlRILOlRI]olRlolRilo 


tab offset (8 columns) 


Notes: 
1. F: “0” = line 21, “1” = line 284. CH: “0” = data channel 1, “1” = data channel 2. 
2. “Flash on” blanks associated characters for 0.25 seconds once per second. 


Table 8.30. Closed Captioning Miscellaneous Control Codes. 
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LINE 43 O-O0-0-0-0-0-0-0-0-0-0-0-0-0-0-0 0 

LINE 306 0-0-0-0-0-0-0-0-0-0-0-00-00-0 

LINE 44 00@@000000008800 2 

LINE 307 OOPPBOOCOOOOCCBB0OO 

LINE 45 OOS S@SBOOOCCCCCBOO 4 © BLANK DOT 
LINE 308 OO@@SOCBSOOCCCCCBOO 

LINE 46 OO0@@OCCOSCCOBOCOOO 6 

LINE 309 OOSPOCOCSCCBOOeCOCOO 

LINE 47 OOS BOO0CCBOOOCeCe@0OO 8 @ CHARACTER DOT 
LINE 310 OOPBPOOOCeCBOOOCeCB0OO 

LINE 48 0-0@@00000000 8800 10 

LINE 311 OOPeBOOOCOOCOC0O0 C800 

LINE 49 00@@00000000 8800 12 

LINE 312 OO@eBOOCOO00O00 e800 

LINE 50 00@@00000000 8800 14 

LINE 313 OOPeBPOOOCOOCO0O0C0 C800 

LINE 51 00@@00000000 80800 16 

LINE 314 OOP BOOOCOO00O00 e800 

LINE 52 00@@00000000 86800 18 

LINE 315 OOPeBOCOCOOCO0O0 Ce 00 

LINE 53 O-0-0-0-0-0-0-0-0-0-0-0-0-0-0-0 20 

LINE 316 O=0-0-0-0-0-0-0-0-0-0-0-0-0-0-0 

LINE 54 @-@-© © © © 0 0 006 0060060 0 22 UNDERLINE 
LINE 317 @-@-@ 0 @ 6 0 6 0 0008008 0 

LINE 55 O-0-0-0-0-0-0-0-0-0-0-0-0-0-0-0 24 

LINE 318 0O-0-0-0-0-0-0-0-0-0-0-0-0-0-0-0 


Figure 8.57. Typical 16x26 Closed Captioning Character Cell Format for Row 1. 
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Nondisplay Control Byte Display Control Byte Special 
D6 | D5 | D4 | D3 | D2 | D1 | DO | Dé | DS | D4 | D3 | D2 | D1 | DO Characters 
0 0 0 0 ® 
0 0 0 1 ? 
0 0 1 0 1/2 
0 0 1 1 é 
0 1 0 0 a 
(0) 1 0 1 ¢ 
0 1 1 0 £ 
0 1 1 1 music note 
0 0 1 | CH} 0 0 1 0 1 1 
1 0 0 0 a 
1 0 0 1 transparent space 
1 0 1 0 e 
1 0 1 1 a 
1 1 0 0 é 
1 1 0 1 i 
1 1 1 0 6 
1 1 1 1 a 
=) 
a 
+ ° a ° a ° a ° a ° a ° a 
a ° ° a a =) ° a a ° ° a a 
9 a a a a } } ° ° a a a a 
a ° ° i) i) a a a a a a a a 
© 
a 
D2 D1 DO 
000 ( 0 8 @ H P X u h p x 
001 ! ) 1 A I Q Y a i y 
010 a a 2 B J R Z b j r 
O11 # + 3 : Cc K S [ c k Ss ¢ 
100 $ i 4 < D L T é d 1 t + 
101 % - 5 = E M U ] e m u N 
110 & : 6 > F N Vv i f n v fi 
111 : / 7 G oO} W 6 g (0) w 


Table 8.31. Closed Captioning Basic Character Set. 
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Non-display Control Byte 


Display Control Byte 


D6 | D5 | D4 | D3 | D2 | D1 | DO | D6 | DS 


Background 


D4 Attribute 


1] 
w 
i] 
iS) 
i] 
Bp 


DO 


CH} 0 0 0 0 1 


white 


green 
blue 


cyan 


red 


yellow 


magenta 
black 


oO 
Rl RRR Re] Oolololo 
Rl RLRILoOlLOolRlRlolo 
Sol RlOlRI|OoOlRlolRIlo 


CH 1 1 1 0 1 


1 transparent 


D6 | D5 | D4 | D3 | D2 | D1 | DO | D6 | DS 


D4 Do Foreground 


Attribute 


1] 
wo 
i] 
nN 
i] 
pB 


CH 1 1 1 0 1 


0 black 
1 black underline 


Notes: 
1. F: “0” = opaque, “1” = semi-transparent. 
2. CH: “0” = data channel 1, “1” = data channel 2. 


3. Underline resides on dot rows 22 and 23, and covers the entire column width. 


Table 8.32. Closed Captioning Optional Attribute Codes. 


If a decoder doesn’t support semitranspar- 
ent colors, the opaque colors may be used 
instead. If a specific background color isn’t 
supported by a decoder, it should default to the 
black background color. However, if the black 
foreground color is supported in a decoder, all 
the background colors should be imple- 
mented. 

A background attribute appears as a stan- 
dard space on the display, and the attribute 
remains in effect until the end of the row or 
until another background attribute is received. 

The foreground attributes provide an 
eighth color (black) as a character color. As 
with midrow codes, a foreground attribute 
code turns off italics and blinking, and the 
least significant bit controls underlining. 


Background and foreground attribute 
codes have an automatic backspace for back- 
ward compatibility with current decoders. 
Thus, an attribute must be preceded by a stan- 
dard space character. Standard decoders dis- 
play the space and ignore the attribute. 
Extended decoders display the space, and on 
receiving the attribute, backspace, then display 
a space that changes the color and opacity. 
Thus, text formatting remains the same 
regardless of the type of decoder. 


Optional Closed Group Extensions 

To support custom features and characters not 
defined by the standards, the EIA/CEG main- 
tains a set of code assignments requested by 
various caption providers and decoder manu- 
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facturers. These code assignments (currently 
used to select various character sets) are not 
compatible with caption decoders in the United 
States and videos using them should not be 
distributed in the U. S. market. 

Closed group extensions require two 
bytes. Table 8.33 lists the currently assigned 
closed group extensions to support captioning 
in the Asian languages. 


Optional Extended Characters 

An additional 64 accented characters (eight 
character sets of eight characters each) may 
be supported by decoders, permitting the dis- 
play of other languages such as Spanish, 
French, Portuguese, German, Danish, Italian, 
Finnish, and Swedish. If supported, these 
accented characters are available in all caption 
and text modes. 


Each of the extended characters incorpo- 
rates an automatic backspace for backward 
compatibility with current decoders. Thus, an 
extended character must be preceded by the 
standard ASCII version of the character. Stan- 
dard decoders display the ASCII character and 
ignore the accented character. Extended 
decoders display the ASCII character, and on 
receiving the accented character, backspace, 
then display the accented character. Thus, text 
formatting remains the same regardless of the 
type of decoder. 

Extended characters require two bytes. 
The first byte is 12y or 13y for data channel 
one (1Ay or 1By for data channel two), fol- 
lowed by a value of 209-3F y. 


Non-display Control Byte Display Control Byte 
Background 
pe | D5 | D4 | D3 | D2 | p1 | Do | pe | ps5 | pa | v3 | v2 | pa | vo Aceribute 
standard character 
0 1 0 0 set (normal size) 
standard character 
a 1 0 | set (double size) 
first private 
0 : 1 7 character set 
0 1 1 1 second private 
0 0 1 CH 1 1 1 0 1 0 character set 
People’s Republic 
1 0 0 (0) of China character 
set (GB 2312) 
Korean Standard 
1 0 0 1 character set 
(KSC 5601-1987) 
1 0 1 (0) first registered character set 
Notes: 


1. CH: “0” = data channel 1, “1” = data channel 2. 


Table 8.33. Closed Captioning Optional Closed Group Extensions. 


VBI Data 343 


Extended Data Services 

Line 284 may contain extended data service 
information, interleaved with the caption and 
text information, as bandwidth is available. In 
this case, control codes are not transmitted 
twice, as they may be for the caption and text 
services. 

Information is transmitted as packets and 
operates as a separate unique data channel. 
Data for each packet may or may not be contig- 
uous and may be separated into subpackets 
that can be inserted anywhere space is avail- 
able in the line 284 information stream. 

There are four types of extended data char- 
acters: 


Control: Control characters are used as a 
mode switch to enable the extended data 
mode. They are the first character of two and 
have a value of 01p to OF y. 


Type: Type characters follow the control 
character (thus, they are the second character 
of two) and identify the packet type. They have 
a value of 01p to OF y. 


Checksum: Checksum characters always 
follow the “end of packet” control character. 
Thus, they are the second character of two and 
have a value of 00g to 7Fy. 


Informational: These characters may be 
ASCII or non-ASCII data. They are transmitted 
in pairs up to and including 32 characters. A 
NUL character (00}) is used to ensure pairs of 
characters are always sent. 


Control Characters 

Table 8.34 lists the control codes. The current 
class describes a program currently being 
transmitted. The future class describes a pro- 


gram to be transmitted later. It contains the 
same information and formats as the current 
class. The channel class describes non-pro- 
gram-specific information about the channel. 
The miscellaneous class describes miscella- 
neous information. The public class transmits 
data or messages of a public service nature. 
The undefined class is used in proprietary sys- 
tems for whatever that system wishes. 


Type Characters (Current, Future Class) 


Program Identification Number (01y) 
This packet uses four characters to specify the 
program start time and date relative to Coordi- 
nated Universal Time (UTC). The format is 
shown in Table 8.35. 

Minutes have a range of 0-59. Hours have 
a range of 0-23. Dates have a range of 1-31. 
Months have a range of 1-12. “T” indicates if a 
program is routinely tape delayed for the 
Mountain and Pacific time zones. The “D,” “L,” 
and “Z” bits are ignored by the decoder. 


Program Length (024) 
This packet has 2, 4, or 6 characters and indi- 
cates the scheduled length of the program and 
elapsed time for the program. The format is 
shown in Table 8.36. 

Minutes and seconds have a range of 0-59. 
Hours have a range of 0-63. 


Program Name (03) 
This packet contains 2-32 ASCII characters 
that specify the title of the program. 


Program Type (044) 

This packet contains 2-32 characters that spec- 
ify the type of program. Each character is 
assigned a keyword, as shown in Table 8.37. 
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dina Function Class 
oat are current 
024 continue 
034 start 
044 continue future 
054 start 
064 continue channel 
OTH start a 
084 continue miscellaneous 
oat Se public service 

H 

0By start 
0Cy continue reserved 
0Dy start 
0Ey continue undefined 
OFy end all 


Table 8.34. EIA-608 Control Codes. 


D6 | D5 | D4 | D3 | D2 | D1 | DO Character 
1 | m5 | m4 | m3 | m2 |} ml | m0 | minute 
1 D | h4 | h3 | h2 | hi | hO | hour 
1 d4 | d3 | d2 | dl | dO | date 
1 Z T | m3} m2 | ml | m0 | month 


Table 8.35. EIA-608 Program Identification Number Format. 
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Character 


length, minute 


length, hour 


elapsed time, minute 


elapsed time, hour 


elapsed time, second 


COlR]RPtRI]ReiR 
i” 
on 
im” 
cs 
i" 
ow 
i” 
bo 
i" 
= 
im” 
oO 


null character 


Table 8.36. EIA-608 Program Length Format. 


eae Keyword oe Keyword nes Keyword 
20 education 30 business 40 fantasy 
21 entertainment 31 classical 41 farm 
22 movie 32 college 42 fashion 
23 news 33 combat 43 fiction 
24 religious 34 comedy 44 food 
25 sports 35 commentary 45 football 
26 other 36 concert 46 foreign 
27 action 37 consumer 47 fund raiser 
28 advertisement 38 contemporary 48 game/quiz 
29 animated 39 crime 49 garden 
2A anthology 3A dance 4A golf 
2B automobile 3B documentary 4B government 
2C awards 3C drama 4C health 
2D baseball 3D elementary 4D high school 
2E basketball 3E erotica 4E history 
2F bulletin 3F exercise 4F hobby 
50 hockey 60 music 70 romance 
51 home 61 mystery 71 science 
52 horror 62 national 72 series 
53 information 63 nature 73 service 
54 instruction 64 police 74 shopping 
55 international 65 politics 75 soap opera 
56 interview 66 premiere 76 special 
57 language 67 prerecorded 77 suspense 
58 legal 68 product 78 talk 
59 live 69 professional 79 technical 
5A local 6A public 7A tennis 
5B math 6B racing 7B travel 
5C medical 6C reading 7C variety 
5D meeting 6D repair 7D video 
5E military 6E repeat 7E weather 
oF miniseries 6F review 7F western 


Table 8.37. EIA-608 Program Types. 
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Program Rating (05) 

This packet, commonly referred to regarding 
the ”v” chip, contains the information shown in 
Table 8.38 to indicate the program rating. 

V indicates if violence is present. S indi- 
cates if sexual situations are present. L indi- 
cates if adult language is present. D indicates if 
sexually suggestive dialog is present. 


Program Audio Services (06}) 

This packet contains two characters as shown 
in Table 8.39 to indicate the program audio ser- 
vices available. 


Program Caption Services (07) 

This packet contains 2-8 characters as shown 
in Table 8.40 to indicate the program caption 
services available. L2-LO are coded as shown 
in Table 8.39. 


Copy Generation Management System (08) 
This CGMS-A (Copy Generation Management 
System—Analog) packet contains 2 charac- 
ters as shown in Table 8.41. 

In the case where either B3 or B4 is a “0,” 
there is no Analog Protection System (B1 and 
B2 are “0”). BO is the analog source bit. 


Program Aspect Ratio (094) 

This packet contains two or four characters as 
shown in Table 8.42 to indicate the aspect ratio 
of the program. 

SO-S5 specify the first line containing 
active picture information. The value of SO-S5 
is calculated by subtracting 22 from the first 
line containing active picture information. The 
valid range for the first line containing active 
picture information is 22-85. 

E(Q-E5 specify the last line containing 
active picture information. The last line con- 
taining active video is calculated by subtracting 
the value of EO-E5 from 262. The valid range 


for the last line containing active picture infor- 
mation is 199-262. 

When this packet contains all zeros for 
both characters, or the packet is not detected, 
an aspect ratio of 4:3 is assumed. 

The QO bit specifies whether the video is 
squeezed (“1”) or normal (“0”). Squeezed 
video (anamorphic) is the result of compress- 
ing a 16:9 aspect ratio picture into a 4:3 aspect 
ratio picture without cropping side panels. 

The aspect ratio is calculated as follows: 


320/ (E-S):1 


Program Description (10q-174) 
This packet contains 1-8 packet rows, with 
each packet row containing 0-32 ASCII charac- 
ters. A packet row corresponds to a line of text 
on the display. 

Each packet is used in numerical 
sequence, and if a packet contains no ASCII 
characters, a blank line will be displayed. 


Type Characters (Channel Class) 


Network Name (01y) 
This packet uses 2-32 ASCII characters to 
specify the network name. 


Network Call Letters (02) 

This packet uses four or six ASCII characters 
to specify the call letters of the channel. When 
six characters are used, they reflect the over- 
the-air channel number (2-69) assigned by the 
FCC. Single-digit channel numbers are pre- 
ceded by a zero or a null character. 


Channel Tape Delay (03) 

This packet uses two characters to specify the 
number of hours and minutes the local station 
typically delays network programs. The format 
of this packet is shown in Table 8.43. 
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D6 D5 D4 D3 D2 | D1 | DO Character 
1 |D/a2| al a0 r2 rl r0 MPAA movie rating 
1 V S |L/a3| g2 | gl | gO | TVrating 
12-10: Movie Rating g2-20: USATV Rating a3-a0: xxx0 MPAA movie rating 
000 not applicable 000 ~=not rated LDO1 USATYV rating 
001 #G 001 TVv-Y 0011 Canadian English TV rating 
010 PG 010 TV-Y7 0111 Canadian French TV rating 
011 PG-13 011 TV-G 1011 reserved 
100 R 100 TV-PG 1111 reserved 
101. NC-17 101. TV-14 
110 Xx 110 TV-MA 
111 not rated 111 not rated 
g2-g0: Canadian English TV Rating g2-g0: Canadian French TV Rating 
000 exempt 000 exempt 
001 oC 001 =#G 
010 C8&+ 010 8ans+ 
011°=#G 011. 13 ans+ 
100 PG 100 16ans+ 
101. 14+ 101. 18 ans+ 
110) «18+ 110 reserved 
111 reserved 111s reserved 
Table 8.38. EIA-608 and EIA-744 Program Rating Format. 
D6 | D5 | D4 D3 | D2 | D1 | DO Character 
1 L2 | Ll | LO | T2 | T1 | TO | main audio program 
1 L2 | Li | LO | S2 | S1 | SO second audio program (SAP) 
L2-L0: 000 unknown T2-T0: 000 unknown S2-S0: 000 unknown 
001 ~~ english 001 mono 001 mono 
010 ~— spanish 010 ~— simulated stereo 010 video descriptions 
011 ~— french 011 ‘true stereo 011 ~—non-program audio 
100 german 100 stereo surround 100 special effects 
101 italian 101 data service 101 data service 
110 other 110 other 110 other 
111 = none 111 none 111 ~~ none 


Table 8.39. EIA-608 Program Audio Services Format. 
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D6 | D5 | D4 D3 | D2 | D1 | DO Character 


1 L2 | Ll | LO F Cc T service code 


FCT: 000 line 21, data channel 1 captioning 
001 line 21, data channel 1 text 
010 line 21, data channel 2 captioning 
011 line 21, data channel 2 text 
100 line 284, data channel 1 captioning 
101 line 284, data channel 1 text 
110 line 284, data channel 2 captioning 
111 line 284, data channel 2 text 


Table 8.40. EIA-608 Program Caption Services Format. 


D6 | D5 | D4 D3 | D2 | D1 | DO Character 


1 0 | B4 | B3 | B2 |} Bl | BO | CGMS 


0 0 0 0 0 0 0 null 


B4-B3 CGMS-A Services: 


00 copying permitted without restriction 
01 condition not to be used 

10 one generation copy allowed 

11 no copying permitted 


B2-B1 Analog Protection Services: 
00 no pseudo-sync pulse 
01 pseudo-sync pulse on; color striping off 


10 pseudo-sync pulse on; 2-line color striping on 
11 pseudo-sync pulse on; 4-line color striping on 


Table 8.41. EIA-608 and EIA IS-702 CGMS-A Format. 
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D6 | D5 | D4 | D3 | D2 | D1 | DO Character 

1 S5 | S4 |} S3 | S2 | S1 | SO start 

1 ES | E4 | E3 | E2 | El | EO end 

1 = & - - - QO | other 

0 0 0 0 0 0 0 null 

Table 8.42. EIA-608 Program Aspect Ratio Format. 
D6 | D5 | D4 | D3 | D2 | D1 | DO Character 

a} m5 | m4 | m3 |} m2 | mil | m0 minute 

1 - h4 | h3 | h2 | hi | ho hour 

Table 8.43. EIA-608 Channel Tape Delay Format. 

D6 | D5 | D4 | D3 | D2 | D1 | DO Character 

1 m5 | m4 | m3 |} m2 | mil | m0 minute 

1 D h4 | h3 | h2 | hl | hO hour 

1 L d4 | d3 | d2 |} di | do date 

1 Z T | m3} m2] mi | m0] month 

1 - - - D2 | D1 | DO day 

1 YS | Y4 | Y3 |} Y2 | Y1 | YO year 

Table 8.44. EIA-608 Time of Day Format. 
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Minutes have a range of 0-59. Hours have 
a range of 0-23. This delay applies to all pro- 
grams on the channel that have the “T” bit set 
in their Program ID packet (Table 8.35). 


Type Characters (Miscellaneous Class) 


Time of Day (01y) 

This packet uses six characters to specify the 
current time of day, month, and date relative to 
Coordinated Universal Time (UTC). The for- 
mat is shown in Table 8.44. 

Minutes have a range of 0-59. Hours have 
a range of 0-23. Dates have a range of 1-31. 
Months have a range of 1-12. Days have a 
range of 1 (Sunday) to 7 (Saturday). Years 
have a range of 0-63 (added to 1990). 

“T” indicates if a program is routinely tape 
delayed for the Mountain and Pacific time 
zones. “D” indicates whether daylight savings 
time currently is being observed. “L” indicates 
whether the local day is February 28th or 29th 
when it is March 1st UTC. “Z” indicates 
whether the seconds should be set to zero (to 
allow calibration without having to transmit the 
full 6 bits of seconds data). 


Impulse Capture ID (02y) 

This packet carries the program start time and 
length, and can be used to tell a VCR to record 
this program. The format is shown in Table 
8.45. 

Start and length minutes have a range of 
0-59. Start hours have a range of 0-23; length 
hours have a range of 0-63. Dates have a range 
of 1-31. Months have a range of 1-12. “T” indi- 
cates if a program is routinely tape delayed for 
the Mountain and Pacific time zones. The “D,” 
“L.” and “Z” bits are ignored by the decoder. 


Supplemental Data Location (03) 
This packet uses 2-32 characters to specify 
other lines where additional VBI data may be 
found. Table 8.46 shows the format. 

“F” indicates field one (“0”) or field two 
(“1”). N may have a value of 7-31, and indi- 
cates a specific line number. 


Local Time Zone (044) 

This packet uses two characters to specify the 
viewer time zone and whether the locality 
observes daylight savings time. The format is 
shown in Table 8.47. 

Hours have a range of 0-23. This is the 
nominal time zone offset, in hours, relative to 
UTC. “D” is a “1” when the area is using day- 
light savings time. 


Out-of-Band Channel Number (40y) 

This packet uses two characters to specify a 
channel number to which all subsequent out- 
of-band packets refer. This is the CATV chan- 
nel number to which any following out-of-band 
packets belong to. The format is shown in 
Table 8.48. 


Closed Captioning for Europe 

Closed captioning may be also used with 625- 
line videotapes and laserdiscs in Europe, 
present during the blanked active line-time 
portion of lines 22 and 335. 

The data format, amplitudes, and rise and 
fall times are the same as for closed captioning 
in the United States. The timing, as shown in 
Figure 8.58, is slightly different due to the 625- 
line horizontal timing. Older closed captioning 
decoders designed for use only with 525-line 
systems may not work due to these timing dif- 
ferences. 
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D6 | D5 | D4 | D3 | D2 | D1 | DO Character 
1 m5 | m4 | m3 | m2 | mil | m0 start, minute 
1 D h4 | h3 | h2 | hi | hO start, hour 
1 d4 | d3 | d2 |} dl | do start, date 
1 Z T | m3 | m2 | mil | m0 start, month 
1 | m5 | m4} m3 | m2 | mi | m0 | length, minute 
1 h5 | h4 | h3 | h2 | hi | ho length, hour 


Table 8.45. EIA-608 Impulse Capture ID Format. 


D6 | D5 | D4 D3 | D2 | D1 | DO Character 


1 F | N4 |} N3 | N2 | N1 | NO | location 


Table 8.46. EIA-608 Supplemental Data Format. 


D6 | D5 | D4 | D3 | D2 | D1 | DO Character 


1 D | h4 | h3 | h2 | hi | hO | hour 
0 0 0 0 0 0 0 null 


Table 8.47. EIA-608 Local Time Zone Format. 


D6 | D5 | D4 | D3 | D2 | D1 | DO Character 


1 c5 | c4 | c3 | c2 | cl | cO channel low 
1 | cll | cl10} c9 | c8 | c7 | c6 channel high 


Table 8.48. EIA-608 Out-of-Band Channel Number Format. 
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Widescreen Signalling 


To facilitate the handling of various aspect 
ratios of program material received by TVs, a 
widescreen signalling (WSS) system has been 
developed. This standard allows a WSS- 
enhanced 16:9 TV to display programs in their 
correct aspect ratio. 


625-Line Systems 
625-line systems are based on ITU-R BT:1119 
and ETSI EN 300 294. For YPbPr and S-video 
interfaces, WSS is present on the Y signal. For 
analog RGB interfaces, WSS is present on all 
three signals. 

The Analog Copy Generation Management 
System (CGMS-A) is also supported by the 
WSS signal. 


10.5 +0.25 uS 13.0 uS 


7 CYCLES 
OF 0.500 MHZ 
(CLOCK RUN-IN) 


4.43 MHZ 
COLOR BURST 


50 +2 IRE (10 CYCLES) 


BLANK LEVEL | 


43 IRE 


SYNC LeveL —! 


$$ i$ 


10.00 + 0.25 pS 


Data Timing 
The first part of line 23 is used to transmit the 
WSS information, as shown in Figure 8.59. 
The clock frequency is 5 MHz (+100 Hz). 
The signal waveform should be a sine-squared 
pulse, with a half-amplitude duration of 200 +10 
ns. The signal amplitude is 500 mV +5%. 
The NRZ data bits are processed by a bi- 
phase code modulator, such that one data 
period equals 6 elements at 5 MHz. 


Data Content 

The WSS consists of a run-in code, a start 
code, and 14 bits of data, as shown in Table 
8.49. 


TWO 7-BIT + PARITY 
ASCIl CHARACTERS 
(DATA) 


240-288 NS 
RISE / FALL 
TIMES 
(2T BAR 
SHAPING) 


27.5 us 


34.0 uS 


Figure 8.58. 625-Line Lines 22 and 335 Closed Captioning Timing. 
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Run-In 
The run-in consists of 29 elements at 5 MHz of 
a specific sequence, shown in Table 8.49. 


Start Code 

The start code consists of 24 elements at 5 
MHz of a specific sequence, shown in Table 
8.49. 


Group A Data 

The group A data consists of 4 data bits that 
specify the aspect ratio. Each data bit gener- 
ates 6 elements at 5 MHz. Data bit b0 is the 
LSB. 

Table 8.50 lists the data bit assignments 
and usage. The number of active lines listed in 
Table 8.50 are for the exact aspect ratio (a = 
1.33, 1.56, or 1.78). 

The aspect ratio label indicates a range of 
possible aspect ratios (a) and number of active 
lines: 


To allow automatic selection of the display 
mode, a 16:9 receiver should support the fol- 
lowing minimum requirements: 


Case 1: The 4:3 aspect ratio picture should 
be centered on the display, with black bars 
on the left and right sides. 


Case 2: The 14:9 aspect ratio picture 
should be centered on the display, with 
black bars on the left and right sides. Alter- 
nately, the picture may be displayed using 
the full display width by using a small (typ- 
ically 8%) horizontal geometrical error. 


Case 3: The 16:9 aspect ratio picture 
should be displayed using the full width of 
the display. 


Case 4: The >16:9 aspect ratio picture 
should be displayed as in Case 3 or use the 
full height of the display by zooming in. 


Group B Data 
The group B data consists of four data bits that 
specify enhanced services. Each data bit gen- 


4:3 a< 1.46 527-576 
14:9 146<a<1.66 463-526 
16:9 1.66<a<1.90 405-462 
>16:9 a>1.90 < 405 
j 
RUN 
IN 
COLOR 
500 MV +5% BURST 29 


5 MHZ 
ELEMENTS 


BLANK LEVEL | 


43 IRE 


sYNc LeveL 


_ — 


ELEMENTS 


190-210 NS 
RISE / FALL 
TIMES 
(2T BAR 
SHAPING) 


DATA 
(Bo - B13) 


START 
CODE 


= 


24 84 
5 MHZ 5 MHZ 
ELEMENTS 


eee 


11.00 + 0.25 pS 


27.4 us 


Figure 8.59. 625-Line Line 23 WSS Timing. 
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nicl 29 elements 11111 0001 1100 0111 0001 1100 0111 
at 5 MHz (1F1C 71C7y) 
dtalcbde 24 elements 0001 1110 0011 1100 0001 1111 
at 5 MHz (1E 3C1Fy) 
24 elements 
group A at 5 MHz 
(aspect ratio) “0” = 000 111 bO, bl, b2, b3 
“1” = 111 000 
24 elements 
b4, b5, b6, b7 
group B at 5 MHz 7 any ot 
(enhanced services) “0” = 000 111 (b7 = “0” since reserved) 
“1” = 111 000 
18 elements 
group C at 5 MHz 
(subtitles) “(” = 000 111 b8, b9, b10 
“1” = 111 000 
18 elements 
group D at 5 MHz 
(reserved) “0” = 000 111 b11, b12, b13 
“1” = 111 000 
Table 8.49. 625-Line WSS Information. 

Aspect Ratio Position On Active Minimum 
bS, b2, b1, bo Label Format 4:3 Display Lines Requirements 
1000 4:3 full format - 576 case 1 
0001 14:9 letterbox center 504 case 2 
0010 14:9 letterbox top 504 case 2 
1011 16:9 letterbox center 430 case 3 
0100 16:9 letterbox top 430 case 3 
1101 > 16:9 letterbox center - case 4 

1110 14:9 full format center 576 - 
0111 16:9 nana : 576 = 
(anamorphic) 
Table 8.50. 625-Line WSS Group A (Aspect Ratio) Data Bit Assignments and Usage. 
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erates six elements at 5 MHz. Data bit b4 is the 
LSB. Bits b5 and b6 are used for PALplus. 


b4: mode 
0 camera mode 
1 film mode 


b5: color encoding 
0 normal PAL 
1 Motion Adaptive ColorPlus 


b6: helper signals 
0 not present 
1 present 


Group C Data 

The group C data consists of three data bits 
that specify subtitles. Each data bit generates 
six elements at 5 MHz. Data bit b8 is the LSB. 


b8: teletext subtitles 


1 yes 


b10, b9: open subtitles 
00 no 
01 inside active picture 
10 _ outside active picture 
11 reserved 


Group D Data 

The group D data consists of three data bits 
that specify surround sound and copy protec- 
tion. Each data bit generates six elements at 5 
MHz. Data bit b11 is the LSB. 


b11: surround sound 
0 no 
1 yes 


b12: copyright 
0 nocopyright asserted or unknown 
1 copyright asserted 


b13: copy protection 
0 copying not restricted 
1 copying restricted 


525-Line Systems 

EIA-J CPR-1204 and IEC 61880 define a wide- 
screen signalling standard for 525-line sys- 
tems. For YPbPr and S-video interfaces, WSS 
is present on the Y signal. For analog RGB 
interfaces, WSS is present on all three signals. 


Data Timing 
Lines 20 and 283 are used to transmit the WSS 


information, as shown in Figure 8.60. 

The clock frequency is Fs¢/8 or about 
447.443 kHz; Foc is the color subcarrier fre- 
quency of 3.579545 MHz. The signal waveform 
should be a sine-squared pulse, with a half 
amplitude duration of 2.235 us +50 ns. The sig- 
nal amplitude is 70 +10 IRE for a “1,” and 0 +5 
IRE for a “0.” 


Data Content 

The WSS consists of 2 bits of start code, 14 bits 
of data, and 6 bits of CRC, as shown in Table 
8.51. The CRC used is X® + X + 1, all preset to 
as ead 


Start Code 
The start code consists of a “1” data bit fol- 
lowed by a “0” data bit, as shown in Table 8.51. 


Word 0 Data 
Word 0 data consists of 2 data bits: 


00 4:3 aspectratio normal 

01 16:9 aspect ratio anamorphic 
10 4:3aspectratio letterbox 

11 reserved 
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70 +10 IRE 


BLANK LEVEL 1 


40 IRE 


sync LeveL —! 


START | START DATA 
CODE | CODE (BO - B19) 
“44 vg" 


11.20 + 0.30 pS 


— 


49.1 +0.44 pS 


Figure 8.60. 525-Line Lines 20 and 283 WSS Timing. 


start code “1” 

start code “0” 

word 0 b0, b1 

word 1 b2, b3, b4, b5 

word 2 b6, b7, b8, b9, b10, b11, b12, b13 
CRC b14, b15, b16, b17, b18, b19 


Table 8.51. 525-Line WSS Data Bit Assignments and Usage. 


2235 +50 NS 
RISE / FALL 
TIMES 
(2T BAR 
SHAPING) 
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Word 1 Data 
Word 1 data consists of 4 data bits: 


b5, b4, b3, b2: 
0000 copy control information 
1111 default 


Copy control information is transmitted in 
Word 2 data when Word 1 data is “0000.” When 
copy control information is not to be trans- 
ferred, Word 1 data must be set to the default 
value “1111.” 


Word 2 Data 

Word 2 data consists of 14 data bits. When 
Word 1 data is “0000,” Word 2 data consists of 
copy control information. Word 2 copy control 
data must be transferred at the rate of two or 
more frames per two seconds. 

Bits b6 and b7 specify the copy generation 
management system in an analog signal 
(CGMS-A). CGMS-A consists of two bits of dig- 
ital information: 


00 copying permitted 

01 one copy permitted 
10 _—srreserved 

11 no copying permitted 


Bits b8 and b9 specify the operation of the 
the Macrovision copy protection signals added 
to the analog NTSC video signal: 


00 ~——~PSP off 

01 PSP on, 2-line split burst on 
10 PSP on, split burst off 

11 PSP on, 4-line split burst on 


PSP is the Macrovision pseudo-sync pulse 
operation. 


Split burst operation inverts the normal 
phase of the first half of the color burst signal 
on specified scan lines in a normal analog 
video signal. The color burst of four successive 
lines of every 21 lines is modified, beginning at 
lines 24 and 297 (four-line split burst system) 
or of two successive lines of every 17 lines, 
beginning at lines 30 and 301 (two-line split 
burst system). The color burst on all other 
lines is not modified. 

Bit b10 specifies whether the source origi- 
nated from an analog pre-recorded medium. 


b10: 
0 not analog pre-recorded medium 
1 analog pre-recorded medium 


Bits b11, b12, and b13 are reserved and are 
“O00.” 


Teletext 


Teletext allows the transmission of text, graph- 
ics, and data. Data may be transmitted on any 
line, although the VBI interval is most com- 
monly used. The teletext standards are speci- 
fied by ETSI ETS 300 706, ITU-R BT.653 and 
EIA-516. 

For YPbPr and S-video interfaces, teletext 
is present on the Y signal. For analog RGB 
interfaces, teletext is present on all three sig- 
nals. 

There are many systems that use the tele- 
text physical layer to transmit proprietary 
information. The advantage is that teletext has 
already been approved in many countries for 
broadcast, so certification for a new transmis- 
sion technique is not required. 

The data rate for teletext is much higher 
than that used for closed captioning, approach- 
ing up to 7 Mbps in some cases. Therefore, 
ghost cancellation is needed to reliably recover 
the transmitted data. 
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There are seven teletext systems, as System C: 
shown in Table 8.52. EIA-516, also referred to 
as NABTS (North American Broadcast Tele- Brazil United States 
text Specification), is used in the United States, Canada 
and is an expansion of the BT:653 525-line sys- 
tem C standard. System D: 
System A: Japan 
lumbi di Figure 8.61 illustrates the teletext data on a 
Columbia India scan line. If a line normally contains a color 
France burst signal, it will still be present if teletext 
System B: data is present. The 16 bits of clock run-in (or 
clock sync) consists of alternating “1’s” and 
Australia Netherlands “0's.” ; ; 
Belgium New Zealand Figures 8.62 and 8.63 illustrates the struc- 
Ching Norway ture of teletext systems B and C, respectively. 
Denmark Poland 
Egypt Singapore System B Teletext Overview 
Finland South Africa Since teletext System B is the most popular 
Germany Spain teletext format, a basic overview is presented 
Italy Sweden here. 
Jordan Turkey A teletext service typically consists of 
Kuwait United Kingdom pages, with each page corresponding to a 
Malaysia Yugoslavia screen of information. The pages are transmit- 
Morocco ted one at time, and after all pages have been 
Parameter System A System B System C System D 
625-Line Video Systems 
bit rate (Mbps) 6.203125 6.9375 5.734375 5.6427875 
data amplitude 67 IRE 66 IRE 70 IRE 70 IRE 
data per line 40 bytes 45 bytes 36 bytes 37 bytes 
525-Line Video Systems 
bit rate (Mbps) - 5.727272 5.727272 5.727272 
data amplitude - 70 IRE 70 IRE 70 IRE 
data per line - 37 bytes 36 bytes 37 bytes 


Table 8.52. Summary of Teletext Systems and Parameters. 
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transmitted, the cycle repeats, with a typical 
cycle time of about 30 seconds. However, the 
broadcaster may transmit some pages more 
frequently than others, if desired. 

The teletext service is usually based on up 
to eight magazines (allowing up to eight inde- 
pendent teletext services), with each magazine 
containing up to 100 pages. Magazine 1 uses 
page numbers 100-199, magazine 2 uses page 
numbers 200-299, etc. Each page may also 
have sub-pages, used to extend the number of 
pages within a magazine. 

Each page contains 24 rows, with up to 40 
characters per row. A character may be a letter, 
number, symbol, or simple graphic. There are 
also control codes to select colors and other 
attributes such as blinking and double height. 

In addition to teletext information, the tele- 
text protocol may be used to transmit other 
information, such as subtitling, program deliv- 
ery control (PDC), and private data. 


Subtitling 

Subtitling is similar to the closed captioning 
used in the United States. “Open” subtitles are 
the insertion of text directly into the picture 
prior to transmission. “Closed” subtitles are 
transmitted separately from the picture. The 
transmission of closed subtitles in the UK use 
teletext page 888. In the case where multiple 
languages are transmitted using teletext, sepa- 
rate pages are used for each language. 


Program Delivery Control (PDC) 
Program Delivery Control (defined by ETSI 
ETS 300 231 and ITU-R BT:809) is a system 
that controls VCR recording using teletext 
information. The VCR can be programmed to 
look for and record various types of programs 
or a specific program. Programs are recorded 
even if the transmission time changes for any 
reason. 

There are two methods of transmitting 
PDC information via teletext: methods A and 
B. 


BLANK LEVEL 


SYNC LEVEL 


DATA AND ADDRESS 


Figure 8.61. Teletext Line Format. 
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TELETEXT 


APPLICATION LAYER 


PRESENTATION LAYER 


NEXT HEADER PACKET 


LAST PACKET OF PAGE 


PAGE ADDRESS 
(1 BYTE) 


PACKET 26 
PACKET 28 
PACKET 27 PAGE 
HEADER PACKET 
NEXT HEADER PACKET 
PACKET 27 DATA 
GROUP. 


HEADER PACKET 


MAGAZINE / PACKET ADDRESS 


(2 BYTES) ‘ 


DATA BLOCK 


BYTE SYNC 


(1. BYTE) ne 


DATA PACKET 


CLOCK SYNC 


(2 BYTES) ‘ 


DATA UNIT 


Figure 8.62. Teletext System B Structure. 


SESSION LAYER 


TRANSPORT LAYER 


NETWORK LAYER 


LINK LAYER 


PHYSICAL LAYER 


VBI Data 36121 
TELETEXT APPLICATION LAYER 
ITU-T T.101, ANNEX D PRESENTATION LAYER 


RECORD N 


RECORD HEADER 


SESSION LAYER 


RECORD 1 


DATA GROUP N 


DATA GROUP HEADER 


TRANSPORT LAYER 


(8 BYTES) 
DATA GROUP 1 
P HEADER S (1 BYTE) 
(5 BYTES) | 
DATA BLOCK NETWORK LAYER 
BYTE SYNC 
(1 BYTE) 
DATA PACKET LINK LAYER 
CLOCK SYNC 
(2 BYTES) 
DATA UNIT PHYSICAL LAYER 


Figure 8.63. Teletext System C Structure. 
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Method A places the data on a viewable 
teletext page, and is usually transmitted on 
scan line 16. This method is also known as the 
Video Programming System (VPS). 

Method B places the data on a hidden 
packet (packet 26) in the teletext signal. This 
packet 26 data contains the data on each pro- 
gram, including channel, program data, and 
start time. 


Data Broadcasting 

Data broadcasting may be used to transmit 
information to private receivers. Typical appli- 
cations include real-time financial information, 
airport flight schedules for hotels and travel 
agents, passenger information for railroads, 
software upgrades, etc. 


Packets 0-23 

A typical teletext page uses 24 packets, num- 
bered 0-23, that correspond to the 24 rows on 
a displayed page. Packet 24 can add a status 
row at the bottom for user prompting. For each 
packet, three bits specify the magazine 
address (1-8), and five bits specify the row 
address (0-23). The magazine and row 
address bits are Hamming error protected to 
permit single-bit errors to be corrected. 

To save bandwidth, the whole address isn’t 
sent with all packets. Only packet 0 (also called 
the header packet) has all the address informa- 
tion such as row, page, and magazine address 
data. Packets 1-28 contain information that is 
part of the page identified by the most recent 
packet 0 of the same magazine. 

The transmission of a page starts with a 
header packet. Subsequent packets with the 
same magazine address provide additional 
data for that page. These packets may be trans- 
mitted in any order, and interleaved with pack- 
ets from other magazines. A page is 
considered complete when the next header 
packet for that magazine is received. 


The general format for packet 0 is: 


clock run-in 2 bytes 
framing code 1 byte 
magazine androwaddress 2 bytes 
page number 2 bytes 
subcode 4 bytes 
control codes 2 bytes 
display data 32 bytes 


The general format for packets 1-23 is: 


clock run-in 2 bytes 

framing code 1 byte 

magazine androwaddress 2 bytes 

display data 40 bytes 
Packet 24 


This packet defines an additional row for user 
prompting. Teletext decoders may use the data 
in packet 27 to react to prompts in the packet 
24 display row. 


Packet 25 

This packet defines a replacement header line. 
If present, the 40 bytes of data are displayed 
instead of the channel, page, time, and date 
from packet 8.30. 


Packet 26 
Packet 26 consists of: 


clock run-in 2 bytes 
framing code 1 byte 
magazine and row address 2 bytes 
designation code 1 byte 


13 3-byte data groups, each consisting of 
7 data bits 
6 address bits 
5 mode bits 
6 Hamming bits 
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There are 15 variations of packet 26, 
defined by the designation code. Each of the 13 
data groups specify a specific display location 
and data relating to that location. 

This packet is also used to extend the 
addressable range of the basic character set in 
order to support other languages, such as Ara- 
bic, Spanish, Hungarian, Chinese, etc. 

For PDC, packet 26 contains data for each 
program, identifying the channel, program 
date, start time, and the cursor position of the 
program information on the page. When the 
user selects a program, the cursor position is 
linked to the appropriate packet 26 preselec- 
tion data. This data is then used to program the 
VCR. When the program is transmitted, the 
program information is transmitted using 
packet 8.30 format 2. A match between the pre- 
selection data and the packet 8.30 data turns 
the VCR record mode on. 


Packet 27 

Packet 27 tells the teletext decoder how to 
respond to user selections for packet 24. There 
may be up to four packet 27s (packets 27/0 
through 27/3), allowing up to 24 links. It con- 
sists of: 


clock run-in 2 bytes 
framing code 1 byte 

magazine and row address 2 bytes 
designation code 1 byte 

link 1 (red) 6 bytes 
link 2 (green) 6 bytes 
link 3 (yellow) 6 bytes 
link 4 (cyan) 6 bytes 
link 5 (next page) 6 bytes 
link 6 Gndex) 6 bytes 
link control data 1 byte 

page check digit 2 bytes 


Each link consists of: 
7 data bits 
6 address bits 
5 mode bits 
6 hamming bits 


This packet contains information linking 
the current page to six page numbers (links). 
The four colored links correspond to the four 
colored Fastext page request keys on the 
remote. Typically, these four keys correspond 
to four colored menu selections at the bottom 
of the display using packet 24. Selection of one 
of the colored page request keys results in the 
selection of the corresponding linked page. 

The fifth link is used for specifying a page 
the user might want to see after the current 
page, such as the next page in a sequence. 

The sixth link corresponds to the Fastext 
index key on the remote, and specifies the 
page address to go to when the index is 
selected. 


Packets 28 and 29 

These are used to define level 2 and level 3 
pages to support higher resolution graphics, 
additional colors, alternate character sets, etc. 
They are similar in structure to packet 26. 


Packet 8.30 Format 1 

Packet 8.30 (magazine 8, packet 30) isn’t asso- 
ciated with any page, but is sent once per sec- 
ond. This packet is also known as the 
Television Service Data Packet, or TSDP. It 
contains data that notifies the teletext decoder 
about the transmission in general and the time. 


clock run-in 2 bytes 
framing code 1 byte 
magazine androwaddress 2 bytes 
designation code 1 byte 
initial teletext page 6 bytes 
network ID 2 bytes 
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time offset from UTC 1 byte 
date (Modified Julian Day) 3 bytes 
UTC time 3 bytes 
TV program label 4 bytes 
status display 20 bytes 


The Designation Code indicates whether 
the transmission is during the VBI or full-field. 

Initial Teletext Page tells the decoder 
which page should be captured and stored on 
power-up. This is usually an index or menu 
page. 

The Network Identification code identifies 
the transmitting network. 

The TV Program Label indicates the pro- 
gram label for the current program. 

Status Display is used to display a transmis- 
sion status message. 


Packet 8.30 Format 2 

This format is used for PDC recorder control, 
and is transmitted once per second per stream. 
It contains a program label indicating the start 
of each program, usually transmitted about 30 
seconds before the start of the program to 
allow the VCR to detect it and get ready to 
record. 


clock run-in 2 bytes 
framing code 1 byte 
magazine and row address 2bytes 
designation code 1 byte 
initial teletext page 6 bytes 
label channel ID 1 byte 
program control status 1 byte 
country and network ID 2 bytes 
program ID label 5 bytes 
country and network ID 2 bytes 
program type 2 bytes 
status display 20 bytes 


The content is the same as for Format 1, 
except for the 13 bytes of information before 
the status display information. 

Label channel ID (LCI) identifies each of 
up to four PDC streams that may be transmit- 
ted simultaneously. 

The Program Control Status (PCS) indi- 
cates real-time status information, such as the 
type of analog sound transmission. 

The Country and Network ID (CNI) is split 
into two groups. The first part specifies the 
country and the second part specifies the net- 
work. 

Program ID Label (PIL) specifies the 
month, day, and local time of the start of the 
program. 

Program Type (PTY) is a code that indi- 
cates an intended audience or a particular 
series. Examples are “adult,” “children,” 
“music,” “drama,” etc. 


Packet 31 
Packet 31 is used for the transmission of data 
to private receivers. It consists of: 


clock run-in 2 bytes 
framing code 1 byte 
data channel group 1 byte 
message bits 1 byte 
format type 1 byte 
address length 1 byte 
address 0-6 bytes 
repeat indicator 0-1 byte 
continuity indicator 0-1 byte 
data length 0-1 byte 
user data 28-36 bytes 
CRC 2 bytes 
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ATVEF Interactive Content 


ATVEF (Advanced Television Enhancement 
Forum) is a standard for creating and deliver- 
ing enhanced and interactive programs. The 
enhanced content can be delivered over a vari- 
ety of mediums—including analog and digital 
television broadcasts—using terrestrial, cable, 
and satellite networks. 

In defining how to create enhanced con- 
tent, the ATVEF specification defines the mini- 
mum functionality required by ATVEF- 
compliant receivers. To minimize the creation 
of new specifications, the ATVEF uses existing 
Internet technologies such as HTML and Java- 
script. Two additional benefits of doing this are 
that there are already millions of pages of 
potential content, and the ability to use existing 
web-authoring tools. 

The ATVEF 1.0 Content Specification man- 
dates that receivers support, as a minimum, 
HTML 4.0, Javascript 1.1, and Cascading Style 
Sheets. Supporting additional capabilities, 
such as Java and VRML, are optional. This 
ensures content is available to the maximum 
number of viewers. 

For increased capability, a new “tv:” 
attribute is added to the HTML. This attribute 
enables the insertion of the television program 
into the content, and may be used in a HTML 
document anywhere that a regular image may 
be placed. Creating an enhanced content page 
that displays the current television channel 
anywhere on the display is as easy as inserting 
an image in a HTML document. 

The specification also defines how the 
receiver obtains the content and how it is 
informed that enhancements are available. The 
latter task is accomplished with triggers. 


Triggers 

Triggers alert receivers to content enhance- 
ments, and contain information about the 
enhancements. Among other things, triggers 
contain a Universal Resource Locator (URL) 
that defines the location of the enhanced con- 
tent. Content may reside locally—such as 
when delivered over the network and cached 
to a local hard drive—or it may reside on the 
Internet or another network. 

Triggers may also contain a human-read- 
able description of the content. For example, it 
may contain the description “Press ORDER to 
order this product,” which can be displayed for 
the viewer. Triggers also may contain expira- 
tion information, indicating how long the 
enhancement should be offered to the viewer. 

Lastly, triggers may contain scripts that 
trigger the execution of Javascript within the 
associated HTML page, to support synchroni- 
zation of the enhanced content with the video 
signal and updating of dynamic screen data. 


Transports 

Besides defining how content is displayed and 
how the receiver is notified of new content, the 
specification also defines how content is deliv- 
ered. Because a receiver may not have an 
Internet connection, the — specification 
describes two models for delivering content. 
These two models are called transports, and 
the two transports are referred to as Transport 
Type A and Transport Type B. 

If the receiver has a back-channel (or 
return path) to the Internet, Transport Type A 
will broadcast the trigger and the content will 
be pulled over the Internet. 
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If the receiver does not have an Internet 
connection, Transport Type B provides for 
delivery of both triggers and content via the 
broadcast medium. Announcements are sent 
over the network to associate triggers with 
content streams. An announcement describes 
the content, and may include information 
regarding bandwidth, storage requirements, 
and language. 


Delivery Protocols 

For traditional bi-directional Internet commu- 
nication, the Hypertext Transfer Protocol 
(HTTP) defines how data is transferred at the 
application level. For uni-directional broad- 
casts where a two-way connection is not avail- 
able, ATVEF also defines a_ uni-directional 
application-level protocol for data delivery: 
Uni-directional Hypertext Transfer Protocol 
(UHTTP). 

Like HTTP UHTTP uses traditional URL 
naming schemes to reference content. Content 
can reference enhancement pages using the 
standard “http:” and “ftp:” naming schemes. 
However, ATVEF also adds the “lid:,” or local 
identifier URL, naming scheme. This allows 
reference to content that exists locally (such as 
on the receiver's hard drive) as opposed to on 
the Internet or other network. 


Bindings 

How data is delivered over a specific network 
is called binding. The ATVEF has defined bind- 
ings for IP multicast and NTSC. The binding to 
IP is referred to as “reference binding.” 


ATVEF Over NTSC 

Transport Type A triggers are broadcast on 
data channel 2 of the EJA-608 captioning sig- 
nal. 

Transport Type B binding also includes a 
mechanism for delivering IP over the vertical 
blanking interval (VBI), otherwise known as 
IP over VBI (IP/VBI). At the lowest level, the 
television signal transports NABTS (North 
American Basic Teletext Standard) packets 
during the VBI. These NABTS packets are 
recovered to form a sequential data stream 
(encapsulated in a SLIP-like protocol) that is 
unframed to produce IP packets. 


“Raw” VBI Data 


“Raw,” or oversampled, VBI data is simply digi- 
tized VBI data. It is typically oversampled 
using a 2x video sample clock, such as 27 
MHz. Two applications for “raw” VBI data are 
PCs (for software decoding of VBI data) and 
settop boxes (to pass the VBI data on to the 
NTSC/PAL encoder). 

VBI data may be present on any scan line, 
except during the serration and equalization 
intervals. 

One requirement for oversampled VBI 
data is that the “active line time” be a constant, 
independent of the horizontal timing of the 
BLANK# control signal. Thus, all the VBI data 
is assured to be captured regardless of the out- 
put resolution of the active video data. 

A separate control signal, called 
VBIVALID#, may be used to indicate when VBI 
data is present on the digital video interface. 
This simplifies the design of graphics chips, 
NTSC/PAL encoders, and ASICs that are 
required to separate the VBI data from the dig- 
ital video data. 
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“Sliced” VBI Data 


“Sliced,” or binary, VBI data is useful in MPEG 
video systems, such as settop boxes, DVD, dig- 
ital VCRs, and TVs. It may also be used in PC 
applications to reduce PCI bandwidth. 

VBI data may be present on any scan line, 
except during the serration and equalization 
inter vals. 

A separate control signal, called 
VBIVALID#, may be used to indicate when VBI 
data is present on the digital video interface. 
This simplifies the design of graphics chips, 
NTSC/PAL encoders, and ASICs that are 
required to separate the VBI data from the dig- 
ital video data. 


NTSC/PAL Decoder Considerations 

For sliced VBI data capture, hysteresis must 
be used to prevent VBI decoders from rapidly 
turning on and off due to noise and transmis- 
sion errors. In addition, the VBI decoders 
must also compensate for DC offsets, ampli- 
tude variations, ghosting, and timing varia- 
tions. 

For closed captioning, the caption VBI 
decoder monitors the appropriate scan lines 
looking for the clock run-in and start bits used 
by captioning. If found, it locks on to the clock 
run-in, the caption data is sampled, converted 
to binary data, and the 16 bits of data are trans- 
ferred to registers to be output via the host 
processor or video interface. If the clock run-in 
and start bits are not found, it is assumed the 
scan line contains video data, unless other VBI 
data is detected. 

For WSS, the WSS VBI decoder monitors 
the appropriate scan lines looking for the run- 
in and start codes used by WSS. If found, it 
locks on to the run-in code, the WSS data is 
sampled, converted to binary data, and the 14 
or 20 bits of data are transferred to registers to 
be output via the host processor or video inter- 


face. If the clock run-in and start codes are not 
found, it is assumed the scan line contains 
video data, unless other VBI data is detected. 
For teletext, the teletext VBI decoder mon- 
itors each scan line looking for the 16-bit clock 
run-in code used by teletext. If found, it locks 
on to the clock run-in code, the teletext data is 
sampled, converted to binary data, and the 
data is then transferred to registers to be out- 
put via the teletext or video interface. Conven- 
tional host serial interfaces, such as I°C, 
cannot handle the high bit rates of teletext. 
Thus, a 2-pin serial teletext interface is com- 
monly used. If the 16-bit clock run-in code is 
not found, it is assumed the scan line contains 
video data, unless other VBI data is detected. 


Ghost Cancellation 


Ghost cancellation (the removal of undesired 
reflections present in the signal) is required 
due to the high data rate of some services, 
such as teletext. Ghosting greater than 100 ns 
and -12 dB corrupts teletext data. Ghosting 
greater than —3 dB is difficult to remove cost- 
effectively in hardware or software, while 
ghosting less than -12 dB need not be 
removed. Ghost cancellation for VBI data is 
not as complex as ghost cancellation for active 
video. 

Unfortunately, the GCR (ghost cancella- 
tion reference) signal is not commonly used in 
many countries. Thus, a ghost cancellation 
algorithm must determine the amount of 
ghosting using other available signals, such as 
the serration and equalization pulses. 


NTSC Ghost Cancellation 

The NTSC GCR signal is specified in ATSC A/ 
49 and ITU-R BT:1124. If present, it occupies 
lines 19 and 282. The GCR permits the detec- 
tion of ghosting from —3 to +45 us, and follows 
an 8-field sequence. 
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PAL Ghost Cancellation 

The PAL GCR signal is also specified in 
BT.1124 and ETSI ETS 300 732. If present, it 
occupies line 318. The GCR permits the detec- 
tion of ghosting from -3 to +45 us, and follows 
a 4-frame sequence. 


References 


1. 


10. 


11. 


. EIAJ CPR-1204-1, 


Advanced Television Enhancement 
Forum, Enhanced Content Specification, 
1999. 


. ATSC A/49, 13 May 1993, Ghost Cancelling 


Reference Signal for NTSC. 


. BBC Technical Requirements for Digital 


Television Services, Version 1.0, February 
3, 1999, BBC Broadcast. 


. EJA-189-A, July 1976, Encoded Color Bar 


Signal. 


. EJA-516, May 1988, North American Basic 


Teletext Specification (NABTS). 


. EIA-608, September 1994, Recommended 


Practice for Line 21 Data Service. 


. EJA-744-A, December 1998, Transport of 


Content Advisory Information 
Extended Data Service (XDS). 


Using 


. EIA/IS-702, July 1997, Copy Generation 


Management System (Analog). 

1998, Specifications 
and Transfer Method of Video Aspect Ratio 
Identification Signal UID. 

ETSI EN 300 163, Television Systems: 
NICAM 728: Transmission of Two Channel 
Digital Sound with Terrestrial Television 
Systems B, G, H, I, K1, and L, March 1998. 
ETSI EN 300 294, Television Systems: 625- 
line Television Widescreen Signalling 
(WSS), April 1998. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


ETSI ETS 300 231, Television Systems: 
Specification of the Domestic Video Pro- 
gramme Delivery Control System (PDC), 
April 1998. 

ETSI ETS 300 706, Enhanced Teletext Spec- 
ification, May 1997. 

ETSI ETS 300 708, Television Systems: 
Data Transmission within Teletext, March 
1997. 

ETSI ETS 300 731, Television Systems: 
Enhanced 625-Line Phased Alternate Line 
(PAL) Television: PALplus, March 1997. 
ETSI ETS 300 732, Television Systems: 
Enhanced 625-Line PAL/SECAM Televi- 
sion; Ghost Cancellation Reference (GCR) 
Signals, January 1997. 

Faroudja, Yves Charles, NTSC and Beyond, 
IEEE Transactions on Consumer Electron- 
ics, Vol. 34, No. 1, February 1988. 

IEC 61880, 1998-1, Video Systems (525/ 
60)—Video and Accompanied Data Using 
the Vertical Blanking Interval—Analog 
Interface. 

ITU-R BS.707-3, 1998, Transmission of 
Multisound in Terrestrial Television Sys- 
tems PAL B, G, H, and I and SECAM D, K, 
K1, and L. 

ITU-R BT-470-6, 1998, Conventional Televi- 
sion Systems. 

ITU-R BT-471-1, 1986, Nomenclature and 
Description of Colour Bar Signals. 

ITU-R BT472-3, 1990, Video Frequency 
Characteristics of a Television System to Be 
Used for the International Exchange of Pro- 
grammes Between Countries that Have 
Adopted 625-Line Colour or Monochrome 
Systems. 

ITU-R BT-473-5, 1990, Insertion of Test Sig- 
nals in the Field-Blanking Interval of Mono- 
chrome and Colour Television Signals. 


References 369 


24. 


20. 


26. 


27, 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


ITU-R BT.569-2, 1986, Definition of Param- 
eters for Simplified Automatic Measurement 
of Television Insertion Test Signals. 

ITU-R BT.653-3, 1998, Teletext Systems. 
ITU-R BT:809, 1992, Programme Delivery 
Control (PDC) System for Video Recording. 
ITU-R BT.1118, 1994, Enhanced Compatible 
Widescreen Television Based on Conven- 
tional Television Systems. 

ITU-R BT:1119-2, 1998, Wide-Screen Sig- 
nalling for Broadcasting. 

ITU-R BT.1124, 1994, Reference Signals for 
Ghost Cancelling in Analogue Television 
Systems. 

ITU-R BT:1197-1, 1998, Enhanced Wide- 
Screen PAL TV Transmission System (the 
PALplus System). 

ITU-R BT1298, 1997, Enhanced Wide- 
Screen NTSC TV Transmission System. 
Multichannel Television Sound, BTSC Sys- 
tem Recommended Practices, EIA Televi- 
sion Systems Bulletin No. 5, July 1985, 
Electronic Industries Association. 

NTSC Video Measurements, Tektronix, 
Inc., 1997. 

SMPTE 12M-1999, Television, Audio and 
Film—Time and Control Code. 


39. 


36. 


37, 


38. 


39. 


40. 


41. 


42. 


43. 


SMPTE 170M-1999, Television—Compos- 
ite Analog Video Signal—NTSC for Studio 
Applications. 

SMPTE 262M-1995, Television, Audio and 
Film—Binary Groups of Time and Control 
Codes—Storage and Transmission of Data. 
SMPTE 309M-1999, Televiston—Trans- 
mission of Date and Time Zone Information 
in Binary Groups of Time and Control 
Code. 

SMPTE RP164-1996, Location of Vertical 
Interval Time Code. 

SMPTE RP186-1995, Video Index Informa- 
tion Coding for 525- and 625-Line Televi- 
sion Systems. 

SMPTE RP201-1999, Encoding Film 
Transfer Information Using Vertical Inter- 
val Time Code. 

Specification of Television Standards for 
625-Line System-I Transmissions, 1971, 
Independent Television Authority (ITA) 
and British Broadcasting Corporation 
(BBC). 

Television Measurements, NTSC Systems, 
Tektronix, Inc., 1998. 
Television Measurements, 
Tektronix, Inc., 1990. 


PAL Systems, 


| chapter 9 | 9 


NTSC and PAL Digital 
Encoding and Decoding 


Although not exactly “digital” video, the NTSC 
and PAL composite color video formats are 
currently the most common formats for video. 
Although the video signals themselves are ana- 
log, they can be encoded and decoded almost 
entirely digitally. 

Analog NTSC and PAL encoders and 
decoders have been available for some time. 
However, they have been difficult to use, 
required adjustment, and offered limited video 
quality. Using digital techniques to implement 
NTSC and PAL encoding and decoding offers 
many advantages, such as ease of use, mini- 
mum analog adjustments, and excellent video 
quality. 

In addition to composite video, S-video is 
supported by consumer and pro-video equip- 
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ment, and should also be implemented. S-video 
uses separate luminance (Y) and chrominance 
(C) analog video signals so higher quality may 
be maintained by eliminating the Y/C separa- 
tion process. 

This chapter discusses the design of a digi- 
tal encoder (Figure 9.1) and decoder (Figure 
9.21) that support composite and S-video (M) 
NTSC and (B, D, G, H, I, Nc) PAL video sig- 
nals. (M) and (N) PAL are easily accommo- 
dated with some slight modifications. 

NTSC encoders and decoders are usually 
based on the YCbCr, YUV, or YIQ color space. 
PAL encoders and decoders are usually based 
on the YCbCr or YUV color space. 
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Field 
: Sample Clock a Active Total 
Video Standard Rate Applications Resolution Resolution Rate 
(per second) 
9 MHz SVCD 480 x 480 972 X 525 
BT.601 720! x 480 
(M) NISC, 59.94 
(M) PAL 13.5 MHz MPEG 2 704 x 480 858 x 525 interlaced 
DV 720 x 480 
12.27 MHz square pixels 640 x 480 780 x 525 
9 MHz SVCD 480 x 576 576 x 625 
14.75 MHz square pixels 768 x 576 944 x 625 
(B, D, G, H, I, N, Nc) 2 50 
PAL Bre ae interlaced 
13.5 MHz MPEG 2 704 x 576 864 x 625 
DV 720 x 576 


Table 9.1. Common NTSC/PAL Sample Rates and Resolutions. Typically 716 true active 
samples between 10% blanking points. “Typically 702 true active samples between 50% 


blanking points. 


NTSC and PAL Encoding 


YCbCr input data has a nominal range of 16- 
235 for Y and 16-240 for Cb and Cr. RGB input 
data has a range of 0-255; pro-video applica- 
tions may use a nominal range of 16-235. 

As YCbCr values outside these ranges 
result in overflowing the standard YIQ or YUV 
ranges for some color combinations, one of 
three things may be done, in order of prefer- 
ence: (a) allow the video signal to be generated 
using the extended YIQ or YUV ranges; (b) 
limit the color saturation to ensure a legal 
video signal is generated; or (c) clip the YIQ or 
YUV levels to the valid ranges. 

4:1:1, 4:2:0, or 4:2:2 YCbCr data must be 
converted to 4:4:4 YCbCr data before being 
converted to YIQ or YUV data. The chromi- 
nance lowpass filters will not perform the inter- 
polation properly. 

Table 9.1 lists some of the common sample 
rates and resolutions. 


2x Oversampling 


2x oversampling generates 8:8:8 YCbCr or 
RGB data, simplifying the analog output filters. 
The oversampler is also a convenient place to 
convert from 8-bit to 10-bit data, providing an 
increase in video quality. 


Color Space Conversion 


Choosing the 10-bit video levels to be white = 
800 and sync = 16, and knowing that the sync- 
to-white amplitude is 1V, the full-scale output of 
the D/A converters (DACs) is therefore set to 
1.305V. 


(M) NTSC, (M, N) PAL 

Since (M) NTSC and (M, N) PAL have a 7.5 
IRE blanking pedestal and a 40 IRE sync ampli- 
tude, the color space conversion equations are 
derived so as to generate 0.660V of active 
video. 
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Figure 9.1. Typical NTSC/PAL Digital Encoder Implementation. 
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YUV Color Space Processing 
Modern encoder designs are now based on the 
YUV color space, For these encoders, the 
YCbCr to YUV equations are: 


Y = 0.591 (Y 601 — 64) 
U = 0.504 (Cb — 512) 
V = 0.711(Cr - 512) 
The R’G’B’ to YUV equations are: 


Y = 0.151R’ + 0.297G’ + 0.058B’ 
U =-0.074R’ - 0.147G’ + 0.221B’ 
V = 0.312R’ - 0.261G’ — 0.051B’ 


For pro-video applications using a 10-bit nomi- 
nal range of 64-940 for RGB, the R’G’B’ to 
YUV equations are: 


Y = 0.177(R’ — 64) + 0.347(G’ - 64) + 


0.067 (B’ - 64) 

U =-0.087 (R’ - 64) - 0.171(G’ - 64) + 
0.258 (B’ — 64) 

V = 0.364(R’ - 64) - 0.305(G’ - 64) - 
0.059(B’ — 64) 


Y has a nominal range of 0 to 518, U anom- 
inal range of 0 to +226, and V a nominal range 
of 0 to +319. Negative values of Y should be 
supported to allow test signals, keying infor- 
mation, and real-world video to be passed 
through the encoder with minimum corrup- 
tion. 


YIQ Color Space Processing 

For older NTSC encoder designs based on the 
YIQ color space, the YCbCr to YIQ equations 
are: 


Y = 0.591 (Y¢o1 - 64) 

I = 0.596 (Cr — 512) - 0.274(Cb - 512) 

Q = 0.387(Cr — 512) + 0.423 (Cb — 512) 
The R’G’B’ to YIQ equations are: 


Y = 0.151R’ + 0.297G’ + 0.058B’ 
I = 0.302R’ — 0.139G’ — 0.163B’ 
Q = 0.107R’ — 0.265G’ + 0.158B’ 


For pro-video applications using a 10-bit nomi- 
nal range of 64-940 for R’G’B’, the R’G’B’ to 
YIQ equations are: 


Y = 0.177(R’ — 64) + 0.347(G’ - 64) + 


0.067 (B’ — 64) 

I = 0.352(R’ - 64) - 0.162(G’ - 64) - 
0.190(B’ — 64) 

Q = 0.125(R’ - 64) - 0.309(G’ - 64) + 
0.184(B’ - 64) 


Y has a nominal range of 0 to 518, I anomi- 
nal range of 0 to +309, and Q a nominal range 
of 0 to +271. Negative values of Y should be 
supported to allow test signals, keying infor- 
mation, and real-world video to be passed 
through the encoder with minimum corrup- 
tion. 


YCbCr Color Space Processing 

If the design is based on the YUV color space, 
the Cb and Cr conversion to U and V may be 
avoided by scaling the sin and cos values dur- 
ing the modulation process or scaling the color 
difference lowpass filter coefficients. This has 
the advantage of reducing data path process- 
ing. 
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NTSC-J 

Since the version of (M) NTSC used in Japan 
has a 0 IRE blanking pedestal, the color space 
conversion equations are derived so as to gen- 
erate 0.714V of active video. 


YUV Color Space Processing 
The YCbCr to YUV equations are: 


Y = 0.639 (Y 601 - 64) 
U = 0.545(Cb — 512) 
V = 0.769(Cr - 512) 
The R’G’B’ to YUV equations are: 


Y = 0.164R’ + 0.321G’ + 0.062B’ 
U =-0.080R’ - 0.159G’ + 0.239B’ 
V = 0.337R’ — 0.282G’ — 0.055B’ 


For pro-video applications using a 10-bit nomi- 
nal range of 64-940 for R’G’B’, the R’G’B’ to 
YUV equations are: 


Y = 0.191(R’ - 64) + 0.375(G’ - 64) + 


0.073 (B’ — 64) 

U =-0.094(R’ - 64) - 0.185(G’ - 64) + 
0.279(B’ — 64) 

V = 0.393(R’- 64) - 0.329(G’ - 64) - 
0.064 (B’ — 64) 


Y has a nominal range of 0 to 560, U a nom- 
inal range of 0 to +244, and V a nominal range 
of 0 to +344. Negative values of Y should be 
supported to allow test signals, keying infor- 
mation, and real-world video to be passed 
through the encoder with minimum corrup- 
tion. 


YIQ Color Space Processing 
For older encoder designs based on the YIQ 
color space, the YCbCr to YIQ equations are: 


Y = 0.639 (Y¢o1 - 64) 

I = 0.645(Cr — 512) - 0.297(Cb - 512) 

Q = 0.419(Cr — 512) + 0.457 (Cb — 512) 
The R’G’B’ to YIQ equations are: 


Y = 0.164R’ + 0.321G’ + 0.062B’ 
I = 0.326R’ - 0.150G’ - 0.176B’ 
Q = 0.116R’ — 0.286G’ + 0.170B’ 


For pro-video applications using a 10-bit nomi- 
nal range of 64-940 for R’G’B’, the R’G’B’ to 
YIQ equations are: 


Y = 0.191(R’ - 64) + 0.375(G’ - 64) + 


0.073 (B’ - 64) 

I = 0.381(R’ - 64) - 0.176(G’ - 64) - 
0.205 (B’ - 64) 

Q = 0.135(R’ — 64) - 0.334(G’ - 64) + 
0.199(B’ — 64) 


Y has a nominal range of 0 to 560, I anomi- 
nal range of 0 to +334, and Q a nominal range 
of 0 to +293. Negative values of Y should be 
supported to allow test signals, keying infor- 
mation, and real-world video to be passed 
through the encoder with minimum corrup- 
tion. 


YCbCr Color Space Processing 

If the design is based on the YUV color space, 
the Cb and Cr conversion to U and V may be 
avoided by scaling the sin and cos values dur- 
ing the modulation process or scaling the color 
difference lowpass filter coefficients. This has 
the advantage of reducing data path process- 
ing. 
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(B, D, G, H, I, Nc) PAL 

Since these PAL standards have a 0 IRE blank- 
ing pedestal and a 43 IRE sync amplitude, the 
color space conversion equations are derived 
so as to generate 0.7V of active video. 


YUV Color Space Processing 
The YCbCr to YUV equations are: 


Y = 0.625 (Y 601 - 64) 
U = 0.533 (Cb - 512) 
V = 0.752(Cr — 512) 
The R’G’B’ to YUV equations are: 


Y = 0.160R’ + 0.314G’ + 0.061B’ 
U =-0.079R’ -— 0.155G’ + 0.234B’ 
V = 0.329R’ — 0.275G’ — 0.054B’ 


For pro-video applications using a 10-bit nomi- 
nal range of 64-940 for R’G’B’, the R’G’B’ to 
YUV equations are: 


Y = 0.187(R’ — 64) + 0.367(G’ - 64) + 


0.071(B’ - 64) 

U =-0.092 (R’ - 64) — 0.181(G’ — 64) + 
0.273 (B’ - 64) 

V = 0.385(R’- 64) - 0.322(G’ - 64) - 
0.063 (B’ - 64) 


Y has a nominal range of 0 to 548, U anom- 
inal range of 0 to +239, and V a nominal range 
of 0 to +337. Negative values of Y should be 
supported to allow test signals, keying infor- 
mation, and real-world video to be passed 
through the encoder with minimum corrup- 
tion. 


YCbCr Color Space Processing 
If the design is based on the YUV color space, 
the Cb and Cr conversion to U and V may be 


avoided by scaling the sin and cos values dur- 
ing the modulation process or scaling the color 
difference lowpass filter coefficients. This has 
the advantage of reducing data path process- 
ing. 


Luminance (Y) Processing 


Lowpass filtering to about 6 MHz must be 
done to remove high-frequency components 
generated as a result of the 2x oversampling 
process. 

An optional notch filter may also be used to 
remove the color subcarrier frequency from 
the luminance information. This improves 
decoded video quality for decoders that use 
simple Y/C separation. The notch filter should 
be disabled when generating S-video, RGB, or 
YPbPr video signals. 

Next, any blanking pedestal is added dur- 
ing active video, and the blanking and sync 
information are added. 


(M) NTSC, (M, N) PAL 

As (M) NTSC and (M, N) PAL have a 7.5 IRE 
blanking pedestal, a value of 42 is added to the 
luminance data during active video. 0 is added 
during the blank time. 

After the blanking pedestal is added, the 
luminance data is clamped by a blanking signal 
that has a raised cosine distribution to slow the 
slew rate of the start and end of the video sig- 
nal. Typical blank rise and fall times are 140 
+20 ns for NTSC and 300 +100 ns for PAL. 

Digital composite sync information is 
added to the luminance data after the blank 
processing has been performed. Values of 16 
(syne present) or 240 (no sync) are assigned. 
The sync rise and fall times should be pro- 
cessed to generate a raised cosine distribution 
(between 16 and 240) to slow the slew rate of 
the sync signal. Typical sync rise and fall times 
are 140 +20 ns for NTSC and 250 +50 ns for 
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PAL, although the encoder should generate 
sync edges of about 130 or 240 ns to compen- 
sate for the analog output filters slowing the 
sync edges. 

At this point, we have digital luminance 
with sync and blanking information, as shown 
in Table 9.2. 


NTSC-J 

When generating NTSC-J video, there is a 0 
IRE blanking pedestal. Thus, no blanking ped- 
estal is added to the luminance data during 


active video. Otherwise, the processing is the 
same as for (M) NTSC. 


(B, D, G, H, I, Nc) PAL 

When generating (B, D, G, H, I, No) PAL 
video, there is a 0 IRE blanking pedestal. Thus, 
no blanking pedestal is added to the luminance 
data during active video. 

Blanking information is done using the 
same technique as used for (M) NTSC. How- 
ever, typical blank rise and fall times are 300 
+100 ns. 

Composite sync information is added 
using the same technique as used for (M) 
NTSC, except values of 16 (sync present) or 
252 (no sync) are used. Typical sync rise and 
fall times are 250 +50 ns, although the encoder 
should generate sync edges of about 240 ns to 


compensate for the analog output filters slow- 
ing the sync edges. 

At this point, we have digital luminance 
with sync and blanking information, as shown 
in Table 9.2. 


Analog Luminance (Y) Generation 

The digital luminance data may drive a 10-bit 
DAC that generates a 0-1.305V output to gen- 
erate the Y video signal of a S-video (Y/C) 
interface. 

Figures 9.2 and 9.3 show the luminance 
video waveforms for 75% color bars. The num- 
bers on the luminance levels indicate the data 
value for a 10-bit DAC with a full-scale output 
value of 1.305V. The video signal at the connec- 
tor should have a source impedance of 75Q.. 

As the sample-and-hold action of the DAC 
introduces a (sin x) /x characteristic, the video 
data may be digitally filtered by a [(sin x) /x]7! 
filter to compensate. Alternately, as an analog 
lowpass filter is usually present after the DAC, 
the correction may take place in the analog fil- 
ter. 

As an option, the ability to delay the digital 
Y information a programmable number of 
clock cycles before driving the DAC may be 
useful. If the analog luminance video is low- 
pass filtered after the DAC, and the analog 
chrominance video is bandpass filtered after its 


weer (M) NTSC NTSC-J (B, D, rr 1, Nc) (M, N) PAL 
white 800 800 800 800 
black 282 240 252 282 
blank 240 240 252 240 
syne 16 16 16 16 


Table 9.2. 10-Bit Digital Luminance Values. 
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Figure 9.2. (M) NTSC Luminance (Y) Video Signal for 75% Color Bars. Indicated luminance levels 
are 10-bit values. 
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1.020 V | WHITE LEVEL (800) 
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0.321 V | BLACK / BLANK LEVEL (252) 
43 IRE 

0.020 V ' SYNC LEVEL (16) 


Figure 9.3. (B, D, G, H, 1) PAL Luminance (Y) Video Signal for 75% Color Bars. Indicated luminance 
levels are 10-bit values. 


378 


Chapter 9: NTSC/PAL Digital Encoding and Decoding 


DAC, the chrominance video path may have a 
longer delay (typically up to about 400 ns) than 
the luminance video path. By adjusting the 
delay of the Y data, the analog luminance and 
chrominance video after filtering will be 
aligned more closely, simplifying the analog 
design. 


Color Difference Processing 


Lowpass Filtering 

The color difference signals (CbCr, UV, or IQ) 
should be lowpass filtered using a Gaussian fil- 
ter. This filter type minimizes ringing and over- 
shoot, avoiding the generation of visual 
artifacts on sharp edges. 

If the encoder is used in a video editing 
application, the filters should have a maximum 
ripple of +0.1 dB in the passband. This mini- 
mizes the cumulation of gain and loss artifacts 
due to the filters, especially when multiple 
passes through the encoding and decoding 
processes are done. At the final encoding 
point, Gaussian filters may be used. 


YCbCr and YUV Color Space 
Cb and Cr, or U and V, are lowpass filtered to 
about 1.3 MHz. Typical filter characteristics 
are <2 dB attenuation at 1.3 MHz and >20 dB 
attenuation at 3.6 MHz. The filter characteris- 
tics are shown in Figure 9.4. 


YIQ Color Space 

Q is lowpass filtered to about 0.6 MHz. Typical 
filter characteristics are <2 dB attenuation at 
0.4 MHz, <6 dB attenuation at 0.5 MHz, and >6 
dB attenuation at 0.6 MHz. The filter character- 
istics are shown in Figure 9.5. 


Typical filter characteristics for I are the 
same as for U and V. 


Filter Considerations 

The modulation process is shown in spectral 
terms in Figures 9.6 through 9.9. The fre- 
quency spectra of the modulation process are 
the same as those if the modulation process 
were analog, but are repeated at harmonics of 
the sample rate. 

Using wide-band (1.3 MHz) filters, the 
modulated chrominance spectra overlap near 
the zero frequency regions, resulting in alias- 
ing. Also, there may be considerable aliasing 
just above the subcarrier frequency. For these 
reasons, the use of narrower-band lowpass fil- 
ters (0.6 MHz) may be more appropriate. 

Wide-band Gaussian filters ensure opti- 
mum compatibility with monochrome displays 
by minimizing the artifacts at the edges of col- 
ored objects. A narrower, sharper-cut lowpass 
filter would emphasize the subcarrier signal at 
these edges, resulting in ringing. If mono- 
chrome compatibility can be ignored, a benefi- 
cial effect of narrower filters would be to 
reduce the spread of the chrominance into the 
low-frequency luminance (resulting in low-fre- 
quency cross-luminance), which is difficult to 
suppress in a decoder. 

Also, although the encoder may maintain a 
wide chrominance bandwidth, the bandwidth 
of the color difference signals in a decoder is 
usually much narrower. In the decoder, loss of 
the chrominance upper sidebands (due to low- 
pass filtering the video signal to 4.2-5.5 MHz) 
contributes to ringing and color difference 
crosstalk on color transitions. Any increase in 
the decoder chrominance bandwidth causes a 
proportionate increase in cross-color. 
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Figure 9.4. Typical 1.3-MHz Lowpass Digital Figure 9.5. Typical 0.6-MHz Lowpass Digital 


Filter Characteristics. Filter Characteristics. 
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Figure 9.6. Frequency Spectra for NTSC Digital Chrominance Modulation (Fs = 13.5 MHz, Fsc¢ = 
3.58 MHz). (a) Lowpass filtered U and V signals. (b) Color subcarrier. (c) Modulated chrominance 
spectrum produced by convolving (a) and (b). 
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Figure 9.7. Frequency Spectra for NTSC Digital Chrominance Modulation (Fs = 12.27 MHz, Fsc = 
3.58 MHz). (a) Lowpass filtered U and V signals. (b) Color subcarrier. (c) Modulated chrominance 
spectrum produced by convolving (a) and (b). 
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Figure 9.8. Frequency Spectra for PAL Digital Chrominance Modulation (F, = 13.5 MHz, Fs¢ = 


4.43 MHz). (a) Lowpass filtered U and V signals. (b) Color subcarrier. (c) Modulated chrominance 


spectrum produced by convolving (a) and (b). 
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Figure 9.9. Frequency Spectra for PAL Digital Chrominance Modulation (Fs = 14.75 MHz, Fsc = 


4.43 MHz). (a) Lowpass filtered U and V signals. (b) Color subcarrier. (c) Modulated chrominance 


spectrum produced by convolving (a) and (b). 
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Chrominance (C) Modulation 


(M) NTSC, NTSC-J 

During active video, the CbCr, UV, or IQ data 
modulate sin and cos subcarriers, as shown in 
Figure 9.1, resulting in digital chrominance 
(C) data. For this design, the 11-bit reference 
subcarrier phase (see Figure 9.17) and the 
burst phase are the same (180°). 

For YUV and YCbCr processing, 180° 
must be added to the 11-bit reference subcar- 
rier phase during active video time so the out- 
put of the sin and cos ROMs have the proper 
subcarrier phases (0° and 90°, respectively). 

For YIQ processing, 213° must be added to 
the 11-bit reference subcarrier phase during 
active video time so the output of the sin and 
cos ROMs have the proper subcarrier phases 
(33° and 123°, respectively). 

For the following equations, 


@ = 2nF sc 
Foc = 3.579545 MHz (10 Hz) 


YUV Color Space 
As discussed in Chapter 8, the chrominance 
signal may be represented by: 


(U sin wt) + (V cos wt) 
Chrominance amplitudes are +sqrt(U? + V) 
YCbCr Color Space 
If the encoder is based on the YCbCr color 


space, the chrominance signal may be repre- 
sented by: 


(Cb — 512) (0.504) (sin wt) + 
(Cr — 512) (0.711) (cos wt) 


For NTSC-J systems, the equations are: 


(Cb — 512) (0.545) (sin wt) + 
(Cr — 512) (0.769) (cos ot) 


In these cases, the values in the sin and 
cos ROMs are scaled by the indicated values to 
allow the modulator multipliers to accept Cb 
and Cr data directly, instead of U and V data. 


YIQ Color Space 
As discussed in Chapter 8, the chrominance 
signal may also be represented by: 


(Q sin (wt + 33°)) + cos (at + 33°)) 


Chrominance amplitudes are +sqrt(I? + Q?) 


(B, D, G, H, I, M, N, Nc) PAL 
During active video, the CbCr or UV data mod- 
ulate sin and cos subcarriers, as shown in Fig- 
ure 9.1, resulting in digital chrominance (C) 
data. For this design, the 11-bit reference sub- 
carrier phase (see Figure 9.17) is 135°. 

For the following equations, 


@= 2nF sc 


Fg¢ = 4.43361875 MHz (+5 Hz) 
for (B, D, G, H, I, N) PAL 


Foc = 3.58205625 MHz (+5 Hz) 
for (Nc) PAL 


Foc = 3.57561149 MHz (+5 Hz) 
for (M) PAL 


PAL Switch 

In theory, since the [sin wt] and [cos wt] sub- 
carriers are orthogonal, the U and V signals 
can be perfectly separated from each other in 
the decoder. However, if the video signal is 
subjected to distortion, such as asymmetrical 
attenuation of the sidebands due to lowpass fil- 
tering, the orthogonality is degraded, resulting 
in crosstalk between the U and V signals. 
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PAL uses alternate line switching of the V 
signal to provide a frequency offset between 
the U and V subcarriers, in addition to the 90° 
subcarrier phase offset. When decoded, 
crosstalk components appear modulated onto 
the alternate line carrier frequency, in solid 
color areas producing a moving pattern known 
as Hanover bars. This pattern may be sup- 
pressed in the decoder by a comb filter that 
averages equal contributions from switched 
and unswitched lines. 

When PAL Switch = zero, the 11-bit refer- 
ence subcarrier phase (see Figure 9.17) and 
the burst phase are the same (135°). Thus, 
225° must be added to the 11-bit reference sub- 
carrier phase during active video so the output 
of the sin and cos ROMs have the proper sub- 
carrier phases (0° and 90°, respectively). 

When PAL Switch = one, 90° is added to 
the 11-bit reference subcarrier phase, resulting 
in a 225° burst phase. Thus, an additional 135° 
must be added to the 11-bit reference subcar- 
rier phase during active video so the output of 
the sin and cos ROMs have the proper phases 
(0° and 90°, respectively). 

Note that in Figure 9.17, while PAL Switch 
= one, the —-V subcarrier is generated, imple- 
menting the -V component. 


YUV Color Space 
As discussed in Chapter 8, the chrominance 
signal is represented by: 


(U sin wt) + (V cos at) 


with the sign of V alternating from one line to 
the next (known as the PAL Switch). 

Chrominance amplitudes are +sqrt(U? + 
Vo: 


YCbCr Color Space 

If the encoder is based on the YCbCr color 
space, the chrominance signal for (B, D, G, H, 
I, Nc) PAL may be represented by: 


(Cb — 512) (0.533) (sin wt) + 
(Cr — 512) (0.752) (cos ot) 


The chrominance signal for (M, N) PAL may 
be represented by: 


(Cb — 512) (0.504) (sin wt) + 
(Cr — 512) (0.711) (cos ot) 


In these cases, the values in the sin and 
cos ROMs are scaled by the indicated values to 
allow the modulator multipliers to accept Cb 
and Cr data directly, instead of U and V data. 


General Processing 

The subcarrier sin and cos values should have 
a minimum of 9 bits plus sign of accuracy. The 
modulation multipliers must have saturation 
logic on the outputs to ensure overflow and 
underflow conditions are saturated to the max- 
imum and minimum values, respectively. 

After the modulated color difference sig- 
nals are added together, the result is rounded 
to 9 bits plus sign. At this point, the digital 
modulated chrominance has the ranges shown 
in Table 9.3. The resulting digital chrominance 
data is clamped by a blanking signal that has 
the same raised cosine values and timing as 
the one used to blank the luminance data. 
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Burst Generation 

As shown in Figure 9.1, the lowpass filtered 
color difference data are multiplexed with the 
color burst envelope information. During the 
color burst time, the color difference data 
should be ignored and the burst envelope sig- 
nal inserted on the Cb, U, or Q channel (the 
Cr, V, or I channel is forced to zero). 

The burst envelope rise and fall times 
should generate a raised cosine distribution to 
slow the slew rate of the burst envelope. Typi- 
cal burst envelope rise and fall times are 300 
+100 ns. 

The burst envelope should be wide enough 
to generate nine or ten cycles of burst informa- 
tion with an amplitude of 50% or greater. When 
the burst envelope signal is multiplied by the 
output of the sin ROM, the color burst is gener- 
ated and will have the range shown in Table 
9.3. 

For pro-video applications, the phase of the 
color burst should be programmable over a 0° 
to 360° range to provide optional system phase 
matching with external video signals. This can 
be done by adding a programmable value to 
the 11-bit subcarrier reference phase during 
the burst time (see Figure 9.17). 


Analog Chrominance (C) Generation 
The digital chrominance data may drive a 10- 
bit DAC that generates a 0-1.305V output to 
generate the C video signal of an S-video (Y/C) 
interface. The video signal at the connector 
should have a source impedance of 75Q.. 

Figures 9.10 and 9.11 show the modulated 
chrominance video waveforms for 75% color 
bars. The numbers in parentheses indicate the 
data value for a 10-bit DAC with a full-scale out- 
put value of 1.305V. If the DAC can’t handle the 
generation of bipolar video signals, an offset 
must be added to the chrominance data (and 
the sign information dropped) before driving 
the DAC. In this instance, an offset of +512 was 
used, positioning the blanking level at the mid- 
point of the 10-bit DAC output level. 

As the sample-and-hold action of the DAC 
introduces a (sin x) /x characteristic, the video 
data may be digitally filtered by a [(sin x)/x]7! 
filter to compensate. Alternately, as an analog 
lowpass filter is usually present after the DAC, 
the correction may take place in the analog fil- 
ter. 


ee 
peak chroma 328 354 347 328 
peak burst 112 112 117 117 
blank 0 0 0 0 
peak burst -112 -112 -117 -117 
peak chroma -328 -354 -347 -328 


Table 9.3. 10-Bit Digital Chrominance Values. 
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Figure 9.10. (M) NTSC Chrominance (C) Video Signal for 75% Color Bars. Indicated video levels are 
10-bit values. 
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Figure 9.11. (B, D, G, H, 1) PAL Chrominance (C) Video Signal for 75% Color Bars. Indicated video 
levels are 10-bit values. 
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Figure 9.12. (M) NTSC Composite Video Signal for 75% Color Bars. Indicated video levels are 10- 


bit values. 


Analog Composite Video 


The digital luminance (Y) data and the digital 
chrominance (C) data are added together, gen- 
erating digital composite color video with the 
levels shown in Table 9.4. 

The result may drive a 10-bit DAC that 
generates a 0-1.305V output to generate the 
composite video signal. The video signal at the 
connector should have a source impedance of 
75Q.. 


Figures 9.12 and 9.13 show the video wave- 
forms for 75% color bars. The numbers in 
parentheses indicate the data value for a 10-bit 
DAC with a full-scale output value of 1.305V. 

As the sample-and-hold action of the DAC 
introduces a (sin x) /x characteristic, the video 
data may be digitally filtered by a [(sin x) /x]7! 
filter to compensate. Alternately, as an analog 
lowpass filter is usually present after the DAC, 
the correction may take place in the analog fil- 
ter. 
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Figure 9.13. (B, D, G, H, I) PAL Composite Video Signal for 75% Color Bars. Indicated video levels 
are 10-bit values. 


iste NTSC ntsc | | pa 
peak chroma 973 987 983 973 
white 800 800 800 800 
peak burst 352 352 369 357 
black 282 240 252 282 
blank 240 240 252 240 
peak burst 128 128 135 123 
peak chroma 109 53 69 109 
sync 16 16 16 16 


Table 9.4. 10-Bit Digital Composite Video Levels. 
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Black Burst Video Signal 

As an option, the encoder can generate a black 
burst (or house sync) video signal that can be 
used to synchronize multiple video sources. 
Figures 9.14 and 9.15 illustrate the black burst 


0.449 V 


0.357 V 


0.306 V 


0.163 V 


0.020 V 


3.58 MHZ 
COLOR BURST 
(9 CYCLES) 


video signals. Note that these are the same as 
analog composite, but do not contain any active 
video information. The numbers in parenthe- 
ses indicate the data value for a 10-bit DAC 
with a full-scale output value of 1.305V. 


20 IRE 
BLACK LEVEL (282) 
175 IRE | 


| 20 IRE 


40 IRE 


BLANK LEVEL (240) 


SYNC LEVEL (16) 


Figure 9.14. (M) NTSC Black Burst Video Signal. Indicated video levels are 10-bit values. 
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Figure 9.15. 
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(B, D, G, H, 1) PAL Black Burst Video Signal 


BLACK / BLANK LEVEL (252) 
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. Indicated video levels are 10-bit values. 
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Color Subcarrier Generation 


The color subcarrier can be generated from 
the sample clock using a discrete time oscilla- 
tor (DTO). 

When generating video that may be used 
for editing, it is important to maintain the 
phase relationship between the color subcar- 
rier and sync information. Unless the subcar- 
rier phase relative to the sync phase is 
properly maintained, an edit may result in a 
momentary color shift. PAL also requires the 
addition of a PAL Switch, which is used to 
invert the polarity of the V data every other 
scan line. Note that the polarity of the PAL 
Switch should be maintained through the 
encoding and decoding process. 

Since in this design the color subcarrier is 
derived from the sample clock, any jitter in the 
sample clock will result in a corresponding 
subcarrier frequency jitter. In some PCs, the 
sample clock is generated using a phase-lock 
loop (PLL), which may not have the necessary 
clock stability to keep the subcarrier phase jit- 
ter below 2°-3°. 


Frequency Relationships 


(M) NTSC, NTSC-J 

As shown in Chapter 8, there is a defined rela- 
tionship between the subcarrier frequency 
(Fsc) and the line frequency (Fy): 


Fs¢/Fy = 910/4 


Assuming (for example only) a 13.5-MHz 
sample clock rate (Fs): 


Fy = 858 Fy 


Combining these equations produces the 
relationship between Foc and Fs: 


Fs¢/Fs = 35/132 


which may also be expressed in terms of the 
sample clock period (Ts) and the subcarrier 
period (Tc): 


Ts/Tsc¢ = 35/132 


The color subcarrier phase must be 
advanced by this fraction of a subcarrier cycle 
each sample clock. 


(B, D, G, H, I, N) PAL 

As shown in Chapter 8, there is a defined rela- 
tionship between the subcarrier frequency 
(Foc) and the line frequency (Fy): 


Foc/Fy = (1135/4) + (1/625) 


Assuming (for example only) a 13.5-MHz 
sample clock rate (Fs): 


Fy = 864 Fy 


Combining these equations produces the 
relationship between Foc and Fs: 


Fsc/Fs = 709379/2160000 


which may also be expressed in terms of the 
sample clock period (Ts) and the subcarrier 
period (Tsc): 


Ts/Tsc = 709379/2160000 


The color subcarrier phase must be 
advanced by this fraction of a subcarrier cycle 
each sample clock. 


(Nc) PAL 

In the (Nc) PAL video standard used in Argen- 
tina, there is a different relationship between 
the subcarrier frequency (Fsc) and the line 
frequency (Fy): 
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Foc/Fy = (917/4) + (1/625) 


Assuming (for example only) a 13.5-MHz 
sample clock rate (Fs): 


Fy = 864 Fy 


Combining these equations produces the 
relationship between Foc and Fs: 


Fg¢/Fg = 573129/2160000 


which may also be expressed in terms of the 
sample clock period (Ts) and the subcarrier 
period (Tsc): 


Ts/Tsc = 573129/2160000 


The color subcarrier phase must be 
advanced by this fraction of a subcarrier cycle 
each sample clock. 


Quadrature Subcarrier Generation 

A DTO consists of an accumulator in which a 
smaller number [p] is added modulo to 
another number [q]. The counter consists of 
an adder and a register as shown in Figure 
9.16. The contents of the register are con- 
strained so that if they exceed or equal [q], [q] 
is subtracted from the contents. The output 
signal (Xj) of the adder is: 


Xy = Ky_1 + p) modulo q 


With each clock cycle, [p] is added to pro- 
duce a linearly increasing series of digital val- 
ues. It is important that [q] not be an integer 
multiple of [p] so that the generated values are 
continuously different and the remainder 
changes from one cycle to the next. 


MODULO Q 
OUTPUT 
P REGISTER 


Figure 9.16. Single Stage DTO. 


The DTO is used to reduce the sample 
clock frequency, Fs, to the color subcarrier fre- 
quency, Fsc: 


Fgc = (p/q) Fs 


Since [p] is of finite word length, the DTO out- 
put frequency can be varied only in steps. With 
a [p] word length of [w], the lowest [p] step is 
0.5w and the lowest DTO frequency step is: 


Foc = Fo/2” 


Note that the output frequency cannot be 
greater than half the input frequency. This 
means that the output frequency Foc can only 
be varied by the increment [p] and within the 
range: 


0< Foc < Fs/2 


In this application, an overflow corresponds to 
the completion of a full cycle of the subcarrier. 
Since only the remainder (which repre- 
sents the subcarrier phase) is required, the 
number of whole cycles completed is of no 
interest. During each clock cycle, the output of 
the [q] register shows the relative phase of a 
subcarrier frequency in qths of a subcarrier 
period. By using the [q] register contents to 
address a ROM containing a sine wave charac- 
teristic, a numerical representation of the sam- 
pled subcarrier sine wave can be generated. 
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Single Stage DTO 

A single 24-bit or 32-bit modulo [q] register 
may be used, with the 11 most significant bits 
providing the subcarrier reference phase. An 
example of this architecture is shown in Figure 
9.16. 


Multi-Stage DTO 

More long-term accuracy may be achieved if 
the ratio is partitioned into two or three frac- 
tions, the more significant of which provides 
the subcarrier reference phase, as shown in 
Figure 9.17. 

To use the full capacity of the ROM and 
make the overflow automatic, the denominator 
of the most significant fraction is made a power 
of two. The 4x HCOUNT denominator of the 
least significant fraction is used to simplify 
hardware calculations. 

Subdividing the subcarrier period into 
2048 phase steps, and using the total number 
of samples per scan line (HCOUNT), the ratio 
may be partitioned as follows: 


(P2) 
FSC _ (4)(HCOUNT) 
FS 2048 


P1 and P2 are programmed to generate the 
desired color subcarrier frequency (Fc). The 
modulo 4x HCOUNT and modulo 2048 
counters should be reset at the beginning of 
each vertical sync of field one to ensure the 
generation of the correct subcarrier reference 
(as shown in Figures 8.5 and 8.16). 


The less significant stage produces a 
sequence of carry bits which correct the 
approximate ratio of the upper stage by alter- 
ing the counting step by one: from P1 to P1+1. 
The upper stage produces an 11-bit subcarrier 
phase used to address the sine and cosine 
ROMs. 

Although the upper stage adder automati- 
cally overflows to provide modulo 2048 opera- 
tion, the lower stage requires additional 
circuitry because 4x HCOUNT may not be 
(and usually isn’t) an integer power of two. In 
this case, the 16-bit register has a maximum 
capacity of 65535 and the adder generates a 
carry for any value greater than this. To pro- 
duce the correct carry sequence, it is neces- 
sary, each time the adder overflows, to adjust 
the next number added to make up the differ- 
ence between 65535 and 4x HCOUNT. This 
requires: 


P3 = 65536 — (4) (HCOUNT) + P2 


Although this changes the contents of the 
lower stage register, the sequence of carry bits 
is unchanged, ensuring that the correct phase 
values are generated. 

The P1 and P2 values are determined for 
(M) NTSC operation using the following equa- 
tion: 


(P2) 
FSC= | (4)(HCOUNT) 
FS 2048 


_ (910 1 ; 
4 JHCOUNT 
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Figure 9.17. 3-Stage DTO Chrominance Subcarrier Generation. 
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The Pi and P2 values are determined for 
(B, D, G, H, I, N) PAL operation using the fol- 
lowing equation: 


P2 ; 
FSC_ | (4)(HCOUNT) 
FS 2048 


= (1 1 1 ) 
4 ~~ 625 )\HCOUNT 


The P1 and P2 values are determined for the 
version of (Nc) PAL used in Argentina using 
the following equation: 


P2 ; 
FSC_. (4)(HCOUNT) 
FS 2048 


-7 1 1 ) 
~ 4 © 625)\HCOUNT 


The modulo 625 counter, with a [p] value 
of 67, is used during 625-line operation to more 
accurately adjust subcarrier generation due to 
the 0.1072 remainder after calculating the P1 
and P2 values. During 525-line operation, the 
carry signal should always be forced to be 
zero. Table 9.5 lists some of the common hori- 
zontal resolutions, sample clock rates, and 
their corresponding HCOUNT, P1, and P2 val- 
ues. 


Sine and Cosine Generation 

Regardless of the type of DTO used, each 
value of the 11-bit subcarrier phase corre- 
sponds to one of 2048 waveform values taken 
at a particular point in the subcarrier cycle 
period and stored in ROM. The sample points 
are taken at odd multiples of one 4096th of the 
total period to avoid end-effects when the sam- 
ple values are read out in reverse order. 


Note only one quadrant of the subcarrier 
wave shape is stored in ROM, as shown in Fig- 
ure 9.18. The values for the other quadrants 
are produced using the symmetrical properties 
of the sinusoidal waveform. The maximum 
phase error using this technique is +0.09° (half 
of 360/2048), which corresponds to a maxi- 
mum amplitude error of +0.08%, relative to the 
peak-to-peak amplitude, at the steepest part of 
the sine wave signal. 

Figure 9.17 also shows a technique for 
generating quadrature subcarriers from an 11- 
bit subcarrier phase signal. It uses two ROMs 
to store quadrants of sine and cosine wave- 
forms. XOR gates invert the addresses for gen- 
erating time-reversed portions of the 
waveforms and to invert the output polarity to 
make negative portions of the waveforms. An 
additional gate is provided in the sign bit for 
the V subcarrier to allow injection of a PAL 
Switch square wave to implement phase inver- 
sion of the V signal on alternate scan lines. 


Horizontal and Vertical Timing 


Vertical and horizontal counters are used to 
control the video timing. 


Timing Control 

To control the horizontal and vertical counters, 
separate horizontal sync (HSYNC#) and verti- 
cal sync (VSYNC#) signals are commonly 
used. A BLANK# control signal is usually used 
to indicate when to generate active video. 

If HSYNC#, VSYNC#, and BLANK# are 
inputs, controlling the horizontal and vertical 
counters, this is referred to as “slave” timing. 
HSYNC#, VSYNC#, and BLANK# are gener- 
ated by another device in the system, and used 
by the encoder to generate the video. 
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4096TH'S OF A SUBCARRIER PERIOD 


Figure 9.18. Positions of the 512 Stored Sample Values 
in the sin and cos ROMs for One Quadrant of a 
Subcarrier Cycle. Samples for other quadrants are 
generated by inverting the addresses and/or sign 


values. 
F Total Samples 
Typical é 4x 
iat per Scan Line P1 P2 
Application (HCOUNT) HCOUNT 
13.5 MHz (M) NTSC 858 3432 543 104 
13.5 MHz (B, D, G, H, I) PAL 864 3456 672 2061 
12.27 MHz (M) NTSC 780 3120 597 1040 
14.75 MHz (B, D, G, H, I) PAL 944 3776 615 2253 


Table 9.5. Typical HCOUNT, P1, and P2 Values for the 3-Stage DTO in 


Figure 9.17. 
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The horizontal and vertical counters may 
also be used to generate the basic video tim- 
ing. In this case, referred to as “master” tim- 
ing, HSYNC#, VSYNC#, and BLANK# are 
outputs from the encoder, and used elsewhere 
in the system. 

For a BT.656 or VIP video interface, hori- 
zontal blanking (H), vertical blanking (V), and 
field (F) information are used. In this applica- 
tion, the encoder would use the H, V, and F 
timing bits directly, rather than depending on 
HSYNC#, VSYNC#, and BLANK# control sig- 
nals. 

Table 9.6 lists the typical horizontal blank 
timing for common sample clock rates. A 
blanking control signal (BLANK#) is used to 
specify when to generate active video. 


Horizontal Timing 

An 11-bit horizontal counter is incremented on 
each rising edge of the sample clock, and reset 
by HSYNC#. The counter value is monitored to 
determine when to assert and negate various 
control signals each scan line, such as the start 
of burst envelope, end of burst envelope, etc. 


During “slave” timing operation, if there is 
no HSYNC# pulse at the end of a line, the 
counter can either continue incrementing (rec- 
ommended) or automatically reset (not recom- 
mended). 


Vertical Timing 

A 10-bit vertical counter is incremented on 
each leading edge of HSYNC#, and reset when 
coincident leading edges of VSYNC# and 
HSYNC# occur. Rather than exactly coincident 
falling edges, a “coincident window” of about 
+64 clock cycles should be used to ease inter- 
facing to some video timing controllers. If both 
the HSYNC# and VSYNC# leading edges are 
detected within 64 clock cycles of each other, it 
is assumed to be the beginning of Field 1. The 
counter value is monitored to determine which 
scan line is being generated. 

For interlaced (M) NTSC, color burst 
information should be disabled on scan lines 
1-9 and 264-272, inclusive. On the remaining 
scan lines, color burst information should be 
enabled and disabled at the appropriate hori- 
zontal count values. 


Typical Sync + Back Porch ae ne 

Application Blanking (Samples) (Samples) 
13.5 MHz (M) NTSC 122 16 
13.5 MHz (B, D, G, H, I) PAL 132 12 
12.27 MHz (M) NTSC 118 22 
14.75 MHz (B, D, G, H, I) PAL 155 21 


Table 9.6. Typical BLANK# Horizontal Timing. 
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For noninterlaced (M) NTSC, color burst 
information should be disabled on scan lines 
1-9, inclusive. A 29.97 Hz (30/1.001) offset 
may be added to the color subcarrier fre- 
quency so the subcarrier phase will be 
inverted from field to field. On the remaining 
scan lines, color burst information should be 
enabled and disabled at the appropriate hori- 
zontal count values. 

For interlaced (B, D, G, H, I, N, Nc) PAL, 
during fields 1, 2, 5, and 6, color burst informa- 
tion should be disabled on scan lines 1-6, 310- 
318, and 623-625, inclusive. During fields 3, 4, 
7, and 8, color burst information should be dis- 
abled on scan lines 1-5, 311-319, and 622-625, 
inclusive. On the remaining scan lines, color 
burst information should be enabled and dis- 
abled at the appropriate horizontal count val- 
ues. 
For noninterlaced (B, D, G, H, I, N, No) 
PAL, color burst information should be dis- 
abled on scan lines 1-6 and 310-312, inclusive. 
On the remaining scan lines, color burst infor- 
mation should be enabled and disabled at the 
appropriate horizontal count values. 

For interlaced (M) PAL, during fields 1, 2, 
5, and 6, color burst information should be dis- 
abled on scan lines 1-8, 260-270, and 523-525, 
inclusive. During fields 3, 4, 7, and 8, color 
burst information should be disabled on scan 
lines 1-7, 259-269, and 522-525, inclusive. On 
the remaining scan lines, color burst informa- 
tion should be enabled and disabled at the 
appropriate horizontal count values. 

For noninterlaced (M) PAL, color burst 
information should be disabled on scan lines 
1-8 and 260-262, inclusive. On the remaining 
scan lines, color burst information should be 
enabled and disabled at the appropriate hori- 
zontal count values. 


Early PAL receivers produced colored 
“twitter” at the top of the picture due to the 
swinging burst. To fix this, Bruch blanking 
was implemented to ensure that the phase of 
the first burst is the same following each verti- 
cal sync pulse. Analog encoders used a “mean- 
der gate” to control the burst reinsertion time 
by shifting one line at the vertical field rate. A 
digital encoder simply keeps track of the scan 
line and field number. Modern receivers do not 
require Bruch blanking, but it is useful for 
determining which field is being processed. 

During “slave” timing operation, if there is 
no VSYNC# pulse at the end of a frame, the 
counter can either continue incrementing (rec- 
ommended) or automatically reset (not recom- 
mended). 

During “master” timing operation, for pro- 
video applications, it may be desirable to gen- 
erate 2.5 scan line VSYNC# pulses during 625- 
line operation. However, this may cause Field 1 
vs. Field 2 detection problems in some com- 
mercially available video chips. 


Field ID Signals 

Although the timing relationship between 
HSYNC# and VSYNC#, or the BT:656 and VIP 
F bit, is used to specify Field 1 or Field 2, addi- 
tional signals may be used to specify which one 
of four or eight fields to generate, as shown in 
Table 9.7. 

FIELD_0 should change state coincident 
with the leading edge of VSYNC# during fields 
1, 3, 5, and 7. FIELD_1 should change state 
coincident with the leading edge of VSYNC# 
during fields 1 and 5. 

For BT.656 or VIP video interfaces, 
FIELD_0 and FIELD_1 may be transmitted 
using ancillary data. 
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Clean Encoding 


Typically, the only filters present in a conven- 
tional encoder are the color difference lowpass 
filters. This results in considerable spectral 
overlap between the luminance and chromi- 
nance components, making it impossible to 
separate the signals completely at the decoder. 

However, additional processing at the 
encoder can be used to reduce cross-color 
(luminance-to-chrominance crosstalk) and 
cross-luminance (chrominance-to-luminance 
crosstalk) decoder artifacts. Cross-color 
appears as a coarse rainbow pattern or random 
colors in regions of fine detail. Cross-lumi- 
nance appears as a fine pattern on chromi- 
nance edges. 

Cross-color in a decoder may be reduced 
by removing some of the high-frequency lumi- 
nance data in the encoder, using a notch filter 


at Foc. However, while reducing the cross- 
color, luminance detail is lost. 

A better method is to pre-comb filter the 
luminance and chrominance information in the 
encoder (see Figure 9.19). High-frequency 
luminance information is precombed to mini- 
mize interference with chrominance frequen- 
cies in that spectrum. Chrominance 
information also is pre-combed by averaging 
over a number of lines, reducing cross-lumi- 
nance or the “hanging dot” pattern. 

This technique allows fine, moving lumi- 
nance (which tends to generate cross-color at 
the decoder) to be removed while retaining full 
resolution for static luminance. However, there 
is a small loss of diagonal luminance resolution 
due to it being averaged over multiple lines. 
This is offset by an improvement in the 
chrominance signal-to-noise ratio (SNR). 


FIELD 2 | FIELDO | "Timing Relationship Field Field 
or BT.656 F Bit Number Number 
0 0 field 1 1 odd field 1 even field 
0 0 field 2 2 even field 2 odd field 
0 1 field 1 3 odd field 3 even field 
0 1 field 2 4 even field 4 odd field 
1 0 field 1 - - 5 even field 
1 0 field 2 - - 6 odd field 
1 1 field 1 - - 7 even field 
1 1 field 2 - - 8 odd field 


Table 9.7. Field Numbering. 


398 Chapter 9: NTSC/PAL Digital Encoding and Decoding 


LPF 
—=| NTSC = 2.3MHZ 


PAL = 3.1 MHZ 


COMPLEMENTARY FILTERS 


HPF 
L| NTSC = 2.3MHZ -——#| 
PAL = 3.1 MHZ 


LUMINANCE 
COMB FILTER 


MODULATOR 


[-—— 


CHROMINANCE 
COMB FILTER 


o— NTSC / PAL 


Figure 9.19. Clean Encoding Example. 


Bandwidth-Limited Edge Generation 


Smooth syne and blank edges may be gener- 
ated by integrating a T, or raised cosine, pulse 
to generate a T step (Figure 9.20). NTSC sys- 
tems use a T pulse with T = 125 ns; therefore, 
the 2T step has little signal energy beyond 4 
MHz. PAL systems use a T pulse with T = 100 
ns; in this instance, the 2T step has little signal 
energy beyond 5 MHz. 


— 125 NS 


> TIME (NS) 
125 0 125 


(A) 


The T step provides a fast risetime, without 
ringing, within a well-defined bandwidth. The 
risetime of the edge between the 10% and 90% 
points is 0.964T: By choosing appropriate sam- 
ple values for the sync edges, blanking edges, 
and burst envelope, these values can be stored 
in a small ROM, which is triggered at the 
appropriate horizontal count. By reading the 
contents of the ROM forward and backward, 
both rising and falling edges may be gener- 
ated. 


> TIME (NS) 
125 0 125 


Figure 9.20. Bandwidth-limited Edge Generation. (a) NTSC T pulse. (b) The 
T step, the result of integrating the T pulse. 
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Level Limiting 


Certain highly saturated colors produce com- 
posite video levels that may cause problems in 
downstream equipment. 

Invalid video levels greater than 100 IRE or 
less than —20 IRE (relative to the blank level) 
may be transmitted, but may cause distortion 
in VCRs or demodulators and cause sync sepa- 
ration problems. 

Illegal video levels greater than 120 IRE 
(NTSC) or 133 IRE (PAL), or below the sync 
tip level, may not be transmitted. 

Although usually not a problem in a con- 
ventional video application, computer systems 
commonly use highly saturated colors, which 
may generate invalid or illegal video levels. It 
may be desirable to optionally limit these sig- 
nal levels to around 110 IRE, compromising 
between limiting the available colors and gen- 
erating legal video levels. 

One method of correction is to adjust the 
luminance or saturation of invalid and illegal 
pixels until the desired peak limits are 
attained. Alternately, the frame buffer contents 
may be scanned, and pixels flagged that would 
generate an invalid or illegal video level (using 
a separate overlay plane or color change). The 
user then may change the color to a more suit- 
able one. 

In a professional editing application, the 
option of transmitting all the video information 
(including invalid and illegal levels) between 
equipment is required to minimize editing and 
processing artifacts. 


Encoder Video Parameters 


Many industry-standard video parameters 
have been defined to specify the relative qual- 
ity of NTSC and PAL encoders. To measure 
these parameters, the output of the encoder 
(while generating various video test signals 


such as those described in Chapter 8) is moni- 
tored using video test equipment. Along with a 
description of several of these parameters, typ- 
ical AC parameter values for both consumer 
and studio-quality encoders are shown in Table 
9.8. 

Several AC parameters, such as group 
delay and K factors, are dependent on the qual- 
ity of the output filters and are not discussed 
here. In addition to the AC parameters dis- 
cussed in this section, there are several others 
that should be included in an encoder specifi- 
cation, such as burst frequency and tolerance, 
horizontal frequency, horizontal blanking time, 
sync rise and fall times, burst envelope rise 
and fall times, video blanking rise and fall 
times, and the bandwidths of the YIQ or YUV 
components. 

There are also several DC parameters 
(such as white level and tolerance, blanking 
level and tolerance, sync height and tolerance, 
peak-to-peak burst amplitude and tolerance) 
that should be specified, as shown in Table 9.9. 


Differential Phase 

Differential phase distortion, commonly 
referred to as differential phase, specifies how 
much the chrominance phase is affected by 
the luminance level—in other words, how 
much hue shift occurs when the luminance 
level changes. Both positive and negative 
phase errors may be present, so differential 
phase is expressed as a peak-to-peak measure- 
ment, expressed in degrees of subcarrier 
phase. 

This parameter is measured using chroma 
of uniform phase and amplitude superimposed 
on different luminance levels, such as the mod- 
ulated ramp test signal or the modulated 5-step 
portion of the composite test signal. The differ- 
ential phase parameter for a studio-quality 
encoder may approach 0.2° or less. 
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Consumer Quality 


Studio Quality 


Parameter Units 
NTSC PAL NTSC PAL 

differential phase 4 <1 degrees 
differential gain 4 <1 % 
luminance nonlinearity 2 <1 % 
hue accuracy 3 <1 degrees 
color saturation accuracy 3 <1 % 
residual subcarrier 0.5, 0.1 IRE 
SNR (per EIA-250-C) 48 > 60 dB 
SCH phase 0 +40 0 +20 0 +2 degrees 
analog Y/C output skew 5 <2 ns 

H tilt <1 <1 % 

V tilt <1 <1 % 
subcarrier tolerance 10 5 10 5 Hz 


Table 9.8. Typical AC Video Parameters for (M) NTSC and (B, D, G, H, I) PAL Encoders. 


Consumer Quality 


Studio Quality 


Parameter Units 
NTSC PAL NTSC PAL 
white relative to blank 714 +70 700 +70 714 47 700 +7 mV 
black relative to blank 54 +5 0 54 +0.5 0 mV 
sync relative to blank —286 +30 -300 +30 286 +3 -300 +3 mV 
burst amplitude 286 +30 300 +30 286 +3 300 +3 mV 


Table 9.9. Typical DC Video Parameters for (M) NTSC and (B, D, G, H, 1) PAL Encoders. 
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Differential Gain 
Differential gain distortion, commonly 
referred to as differential gain, specifies how 
much the chrominance gain is affected by the 
luminance level—in other words, how much 
color saturation shift occurs when the lumi- 
nance level changes. Both attenuation and 
amplification may occur, so differential gain is 
expressed as the largest amplitude change 
between any two levels, expressed as a per- 
centage of the largest chrominance amplitude. 
This parameter is measured using chroma 
of uniform phase and amplitude superimposed 
on different luminance levels, such as the mod- 
ulated ramp test signal or the modulated 5-step 
portion of the composite test signal. The differ- 
ential gain parameter for a _studio-quality 
encoder may approach 0.2% or less. 


Luminance Nonlinearity 
Luminance nonlinearity, also referred to as dif- 
ferential luminance and luminance nonlinear 
distortion, specifies how much the luminance 
gain is affected by the luminance level—in 
other words, a nonlinear relationship between 
the generated and ideal luminance levels. 
Using an unmodulated 5-step or 10-step 
staircase test signal, the difference between 
the largest and smallest steps, expressed as a 
percentage of the largest step, is used to spec- 
ify the luminance nonlinearity. Although this 
parameter is included within the differential 
gain and phase parameters, it is traditionally 
specified independently. 


Chrominance Nonlinear Phase Distortion 
Chrominance nonlinear phase distortion speci- 
fies how much the chrominance phase (hue) is 
affected by the chrominance amplitude (satu- 
ration)—in other words, how much hue shift 
occurs when the saturation changes. 


Using a modulated pedestal test signal, or 
the modulated pedestal portion of the combi- 
nation test signal, the phase differences 
between each chrominance packet and the 
burst are measured. The difference between 
the largest and the smallest measurements is 
the peak-to-peak value, expressed in degrees 
of subcarrier phase. This parameter is usually 
not independently specified, but is included 
within the differential gain and phase parame- 
ters. 


Chrominance Nonlinear Gain Distortion 
Chrominance nonlinear gain distortion speci- 
fies how much the chrominance gain is 
affected by the chrominance amplitude (satu- 
ration)—in other words, a nonlinear relation- 
ship between the generated and _ ideal 
chrominance amplitude levels, usually seen as 
an attenuation of highly saturated chromi- 
nance signals. 

Using a modulated pedestal test signal, or 
the modulated pedestal portion of the combi- 
nation test signal, the test equipment is 
adjusted so that the middle chrominance 
packet is 40 IRE. The largest difference 
between the measured and nominal values of 
the amplitudes of the other two chrominance 
packets specifies the chrominance nonlinear 
gain distortion, expressed in IRE or as a per- 
centage of the nominal amplitude of the worst- 
case packet. This parameter is usually not 
independently specified, but is included within 
the differential gain and phase parameters. 


Chrominance-to-Luminance 
Intermodulation 

Chrominance-to-luminance intermodulation, 
commonly referred to as cross-modulation, 
specifies how much the luminance level is 
affected by the chrominance. This may be the 
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result of clipping highly saturated chromi- 
nance levels or quadrature distortion and may 
show up as irregular brightness variations due 
to changes in color saturation. 

Using a modulated pedestal test signal, or 
the modulated pedestal portion of the combi- 
nation test signal, the largest difference 
between the ideal 50 IRE pedestal level and the 
measured luminance levels (after removal of 
chrominance information) specifies the 
chrominance-to-luminance intermodulation, 
expressed in IRE or as a percentage. This 
parameter is usually not independently speci- 
fied, but is included within the differential gain 
and phase parameters. 


Hue Accuracy 

Hue accuracy specifies how closely the gener- 
ated hue is to the ideal hue value. Both positive 
and negative phase errors may be present, so 
hue accuracy is the difference between the 
worst-case positive and worst-case negative 
measurements from nominal, expressed in 
degrees of subcarrier phase. This parameter is 
measured using EIA or EBU 75% color bars as 
atest signal. 


Color Saturation Accuracy 

Color saturation accuracy specifies how 
closely the generated saturation is to the ideal 
saturation value, using EIA or EBU 75% color 
bars as a test signal. Both gain and attenuation 
may be present, so color saturation accuracy is 
the difference between the worst-case gain and 
worst-case attenuation measurements from 
nominal, expressed as a percentage of nomi- 
nal. 


Residual Subcarrier 

The residual subcarrier parameter specifies 
how much subcarrier information is present 
during white or gray (note that, ideally, none 


should be present). Excessive residual subcar- 
rier is visible as noise during white or gray por- 
tions of the picture. 

Using an unmodulated 5-step or 10-step 
staircase test signal, the maximum peak-to- 
peak measurement of the — subcarrier 
(expressed in IRE) during active video is used 
to specify the residual subcarrier relative to 
the burst amplitude. 


SCH Phase 

SCH (Subcarrier to Horizontal) phase refers to 
the phase relationship between the leading 
edge of horizontal sync (at the 50% amplitude 
point) and the zero crossings of the color burst 
(by extrapolating the color burst to the leading 
edge of sync). The error is referred to as SCH 
phase and is expressed in degrees of subcar- 
rier phase. 

For PAL, the definition of SCH phase is 
slightly different due to the more complicated 
relationship between the sync and subcarrier 
frequencies—the SCH phase relationship for a 
given line repeats only once every eight fields. 
Therefore, PAL SCH phase is defined, per 
EBU Technical Statement D 23-1984 (E), as 
“the phase of the +U component of the color 
burst extrapolated to the half-amplitude point 
of the leading edge of the synchronizing pulse 
of line 1 of field 1.” 

SCH phase in important when merging 
two or more video signals. To avoid color shifts 
or “picture jumps,” the video signals must have 
the same horizontal, vertical, and subcarrier 
timing and the phases must be closely 
matched. To achieve these timing constraints, 
the video signals must have the same SCH 
phase relationship since the horizontal sync 
and subcarrier are continuous signals with a 
defined relationship. It is common for an 
encoder to allow adjustment of the SCH phase 
to simplify merging two or more video signals. 
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Maintaining proper SCH phase is also impor- 
tant since NTSC and PAL decoders may moni- 
tor the SCH phase to determine which color 
field is being decoded. 


Analog Y/C Video Output Skew 

The output skew between the analog lumi- 
nance (Y) and chrominance (C) video signals 
should be minimized to avoid phase shift 
errors between the luminance and chromi- 
nance information. Excessive output skew is 
visible as artifacts along sharp vertical edges 
when viewed on a monitor. 


H Tilt 

H tilt, also known as line tilt and line time dis- 
tortion, causes a tilt in line-rate signals, pre- 
dominantly white bars. This type of distortion 
causes variations in brightness between the 
left and right edges of an image. For a digital 
encoder, such as that described in this chapter, 
H tilt is primarily an artifact of the analog out- 
put filters and the transmission medium. 

H tilt is measured using a line bar (such as 
the one in the NTC-7 NTSC composite test sig- 
nal) and measuring the peak-to-peak deviation 
of the tilt (in IRE or percent of white bar ampli- 
tude), ignoring the first and last microsecond 
of the white bar. 


V Tilt 
V tilt, also known as field tilt and field time dis- 
tortion, causes a tilt in field-rate signals, pre- 
dominantly white bars. This type of distortion 
causes variations in brightness between the 
top and bottom edges of an image. For a digital 
encoder, such as that described in this chapter, 
V tilt is primarily an artifact of the analog out- 
put filters and the transmission medium. 

V tilt is measured using a 18 us, 100 IRE 
white bar in the center of 130 lines in the cen- 
ter of the field or using a field square wave. 


The peak-to-peak deviation of the tilt is mea- 
sured (in IRE or percent of white bar ampli- 
tude), ignoring the first three and last three 
lines. 


Genlocking Support 


In many instances, it is desirable to be able to 
genlock the output (align the timing signals) of 
an encoder to another composite analog video 
signal to facilitate downstream video process- 
ing. This requires locking the horizontal, verti- 
cal, and color subcarrier frequencies and 
phases together, as discussed in the NTSC/ 
PAL decoder section of this chapter. In addi- 
tion, the luminance and chrominance ampli- 
tudes must be matched. A major problem in 
genlocking is that the regenerated sample 
clock may have excessive jitter, resulting in 
color artifacts. 

One genlocking variation is to send an 
advance house sync (also known as _ black 
burst or advance sync) to the encoder. The 
advancement compensates for the delay from 
the house sync generator to the encoder out- 
put being used in the downstream processor, 
such as a mixer. Each video source has its own 
advanced house sync signal, so each video 
source is time-aligned at the mixing or pro- 
cessing point. 

Another genlocking option allows adjust- 
ment of the subcarrier phase so it can be 
matched with other video sources at the mix- 
ing or processing point. The subcarrier phase 
must be able to be adjusted from 0° to 360°. 
Either zero SCH phase is always maintained or 
another adjustment is allowed to indepen- 
dently position the sync and luminance infor- 
mation in about 10 ns steps. 

The output delay variation between prod- 
ucts should be within about +0.8 ns to allow 
video signals from different genlocked devices 
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to be properly mixed. Mixers usually assume 
the two video signals are perfectly genlocked, 
and excessive time skew between the two 
video signals results in poor mixing perfor- 
mance. 


Alpha Channel Support 


An encoder designed for pro-video editing 
applications may support an alpha channel. 
Eight or ten bits of digital alpha data are input, 
pipelined to match the pipeline of the encoding 
process, and converted to an analog alpha sig- 
nal (discussed in Chapter 7). Alpha is usually 
linear, with the data generating an analog alpha 
signal (also called a key) with a range of 0-100 
IRE. There is no blanking pedestal or sync 
information present. 

In computer systems that support 32-bit 
pixels, 8 bits are typically available for alpha 
information. 


NTSC and PAL Digital 
Decoding 


Although the luminance and chrominance 
components in a NTSC/PAL encoder are usu- 
ally combined by simply adding the two signals 
together, separating them in a decoder is much 
more difficult. Analog NTSC and PAL decod- 
ers have been around for some time. However, 
they have been difficult to use, required adjust- 
ment, and offered limited video quality. 

Using digital techniques to implement 
NTSC and PAL decoding offers many advan- 
tages, such as ease of use, minimum analog 
adjustments, and excellent video quality. The 
use of digital circuitry also enables the design 
of much more robust and sophisticated Y/C 
separator and genlock implementations. 


A general block diagram of a NTSC/PAL 
digital decoder is shown in Figure 9.21. 


Digitizing the Analog Video 


The first step in digital decoding of composite 
video signals is to digitize the entire composite 
video signal using an A/D converter (ADC). 
For our example, 10-bit ADCs are used; there- 
fore, indicated values are 10-bit values. 

The composite and S-video signals are 
illustrated in Figures 9.2, 9.3. 9.10, 9.11, 9.12, 
and 9.13. 

Video inputs are usually AC-coupled and 
have a 75-Q AC and DC input impedance. As a 
result, the video signal must be DC restored 
every scan line during horizontal sync to posi- 
tion the sync tips at a known voltage level. 

The video signal must also be lowpass fil- 
tered (typically to about 6 MHz) to remove any 
high-frequency components that may result in 
aliasing. Although the video bandwidth for 
broadcast is rigidly defined, there is no stan- 
dard for consumer equipment. The video 
source generates as much bandwidth as it can; 
the receiving equipment accepts as much 
bandwidth as it can process. 

Video signals with amplitudes of 0.25x to 
2x ideal are common in the consumer market. 
The active video and/or sync signal may 
change amplitude, especially in editing situa- 
tions where the video signal may be composed 
of several different video sources merged 
together. 

In addition, the decoder should be able to 
handle 100% colors. Although only 75% colors 
may be broadcast, there is no such limitation 
for baseband video. With the frequent use of 
computer-generated text and graphics, highly- 
saturated colors are becoming more common. 
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Figure 9.21. Typical NTSC/PAL Digital Decoder Implementation. 
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DC Restoration 

To remove any DC offset that may be present 
in the video signal, and position it at a known 
level, DC restoration (also called clamping) is 
done. 

For composite or luminance (Y) video sig- 
nals, the analog video signal is DC restored to 
the REF- voltage of the ADC during each hori- 
zontal sync time. Thus, the ADC generates a 
code of 0 during the sync level. 

For chrominance (C) video signals, the 
analog video signal is DC restored to the mid- 
point of the ADC during the horizontal sync 
time. Thus, the ADC generates a code of 512 
during the blanking level. 


Automatic Gain Control 

An automatic gain control (AGC) is used to 
ensure that a constant value for the blanking 
level is generated by the ADC. If the blanking 
level is low or high, the video signal is ampli- 
fied or attenuated until the blanking level is 
correct. 

In S-video applications, the same amount of 
gain that is applied to the luminance video sig- 
nal should also be applied to the chrominance 
video signal. 

After DC restoration and AGC processing, 
an offset of 16 is added to the digitized com- 
posite and luminance signals to match the lev- 
els used by the encoder. 

Tables 9.2, 9.3, and 9.4 show the ideal ADC 
values for composite and s-video sources after 
DC restoration and automatic gain control has 
been done. 


Blank Level Determination 

The most common method of determining the 
blanking level is to digitally lowpass filter the 
video signal to about 0.5 MHz to remove sub- 
carrier information and noise. The back porch 
is then sampled multiple times to determine an 
average blank level value. 


To limit line-to-line variations and clamp 
streaking (the result of quantizing errors), the 
result should be averaged over 3-32 consecu- 
tive scan lines. Alternately, the back porch 
level may be determined during the vertical 
blanking interval and the result used for the 
entire field. 


Video Gain Options 

The difference from the ideal blanking level is 
processed and used in one of several ways to 
generate the correct blanking level: 


(a) controlling a voltage-controlled amplifier 
(b) adjusting the REF+ voltage of the ADC 
(c) multiplying the outputs of the ADC 


In (a) and (b), an analog signal for control- 
ling the gain may be generated by either a 
DAC or a charge pump. If a DAC is used, it 
should have twice the resolution of the ADC to 
avoid quantizing noise. For this reason, a 
charge pump implementation may be more 
suitable. 

Option (b) is dependent on the ADC being 
able to operate over a wide range of reference 
voltages, and is therefore rarely implemented. 

Option (c) is rarely used due to the result- 
ing quantization errors from processing in the 
digital domain. 


Sync Amplitude AGC 

This is the most common mode of AGC, and is 
used where the characteristics of the video sig- 
nal are not known. The difference between the 
measured and the ideal blanking level is used 
to determine how much to increase or 
decrease the gain of the entire video signal. 
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Burst Amplitude AGC 

Another method of AGC is based on the color 
burst amplitude. This is commonly used in pro- 
video applications when the sync amplitude 
may not be related to the active video ampli- 
tude. 

First, the blanking level is adjusted to the 
ideal value, regardless of the sync tip position. 
This may be done by adding or subtracting a 
DC offset to the video signal. 

Next, the burst amplitude is determined. 
To limit line-to-line variations, the burst ampli- 
tude may be averaged over 3-32 consecutive 
scan lines. 

The difference between the measured and 
the ideal burst amplitude is used to determine 
how much to increase or decrease the gain of 
the entire video signal. During the gain adjust- 
ment, the blanking value should not change. 


AGC Options 
For some pro-video applications, such as if the 


video signal levels are known to be correct, if 
all the video levels except the sync height are 
correct, or if there is excessive noise in the 
video signal, it may be desirable to disable the 
automatic gain control. 

The AGC value to use may be specified by 
the user, or the AGC value frozen once deter- 
mined. 


Y/C Separation 


When decoding composite video, the lumi- 
nance (Y) and chrominance (C) must be sepa- 
rated. The many techniques for doing this are 
discussed in detail later in the chapter. 

After Y/C separation, Y has the nominal 
values shown in Table 9.2. Note that the lumi- 
nance still contains sync and blanking informa- 
tion. Modulated chrominance has the nominal 
values shown in Table 9.3. 


The quality of Y/C separation is a major 
factor in the overall video quality generated by 
the decoder. 


Color Difference Processing 


Chrominance (C) Demodulation 

The chrominance demodulator (Figure 9.22) 
accepts modulated chroma data from either 
the Y/C separator or the chroma ADC. It gen- 
erates CbCr, UV, or IQ color difference data. 


(M) NTSC, NTSC-J 

During active video, the chrominance data is 
demodulated using sin and cos subcarrier 
data, as shown in Figure 9.22, resulting in 
CbCr, UV, or IQ data. For this design, the 11-bit 
reference subcarrier phase (see Figure 9.32) 
and the burst phase are the same (180°). 

For YUV or YCbCr processing, 180° must 
be added to the 11-bit reference subcarrier 
phase during active video time so the output of 
the sin and cos ROMs have the proper subcar- 
rier phases (0° and 90°, respectively). 

For YIQ processing, 213° must be added to 
the 11-bit reference subcarrier phase during 
active video time so the output of the sin and 
cos ROMs have the proper subcarrier phases 
(33° and 123°, respectively). 

For all the equations, 


@ = 2nF sc 
Fyc = 3.579545 MHz 


YUV Color Space Processing 

As shown in Chapter 8, the chrominance signal 
processed by the demodulator may be repre- 
sented by: 


(U sin wt) + (V cos wt) 


408 = Chapter 9: NTSC/PAL Digital Encoding and Decoding 


DIGITIZED 
CHROMINANCE 
(S-VIDEO) 


10 


a 


CHROMINANCE FROM 
Y/C SEPARATOR 


NTSC = (2)(1.406)(COS wT) 


512 


0.6 MHZ 10 


PAL = + (2)(1.329)(COS oT) 


FROM 
SUBCARRIER 
GENERATOR 


10 
+ ~| LowPAss }——_“—» cr 


FILTER 


0.6 MHZ 10 


10 | 
NTSC = (2)(1.984)(SIN wT) 
PAL = (2)(1.876)(SIN oT) a + =) POWEASS: 7 cB 


FILTER 


Figure 9.22. Chrominance Demodulation Example That Generates CbCr Directly. 


U is obtained by multiplying the chromi- 
nance data by [2 sin wt], and V is obtained by 
multiplying by [2 cos wt]: 


((U sin wt) + (V cos wt)) (2 sin ot) 
=U- (U cos 2ot) + (V sin 2at) 


((U sin wt) + (V cos wt)) (2 cos at) 
= V+ (Vcos 2at) + (U sin 2at) 


The 2mt components are removed by low- 
pass filtering, resulting in the U and V signals 
being recovered. The demodulator multipliers 


should ensure overflow and underflow condi- 
tions are saturated to the maximum and mini- 
mum values, respectively. The UV signals are 
then rounded to 9 bits plus sign and lowpass 
filtered. 

For (M) NTSC, U has a nominal range of 0 
to +226, and V has a nominal range of 0 to 
+319. 

For NTSC-J used in Japan, U has a nomi- 
nal range of 0 to +244, and V has a nominal 
range of 0 to +344. 
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YIQ Color Space Processing 

As shown in Chapter 8, for older decoders, the 
chrominance signal processed by the demodu- 
lator may be represented by: 


(Q sin (wt + 33°)) + (cos (at + 33°)) 


The subcarrier generator of the decoder pro- 
vides a 33° phase offset during active video, 
cancelling the 33° phase terms in the equation. 

Q is obtained by multiplying the chromi- 
nance data by [2 sin wt], and I is obtained by 
multiplying by [2 cos wt]: 


((Q sin wt) + (1 cos wt)) (2 sin wt) 
=Q - (Q cos 2ot) + (I sin 2@t) 


((Q sin wt) + (I cos wt)) (2 cos at) 
=1+ (Icos 2mt) + (Q sin 2ot) 


The 2mt components are removed by low- 
pass filtering, resulting in the I and Q signals 
being recovered. The demodulator multipliers 
should ensure overflow and underflow condi- 
tions are saturated to the maximum and mini- 
mum values, respectively. The IQ signals are 
then rounded to 9 bits plus sign and lowpass 
filtered. 

For (M) NTSC, I has a nominal range of 0 
to +309, and Q has a nominal range of 0 to 
+271. 

For NTSC-J used in Japan, I has a nominal 
range of 0 to +334, and Q has a nominal range 
of 0 to +293. 


YCbCr Color Space Processing 

If the decoder is based on the YCbCr color 
space, the chrominance signal may be repre- 
sented by: 


(Cb — 512) (0.504) (sin at) + 
(Cr — 512) (0.711) (cos wt) 


For NTSC-]J systems, the equations are: 


(Cb — 512) (0.545) (sin at) + 
(Cr — 512) (0.769) (cos wt) 


In these cases, the values in the sin and 
cos ROMs are scaled by the reciprocal of the 
indicated values to allow the demodulator to 
generate Cb and Cr data directly, instead of U 
and V data. 


(B, D, G, H, I, M, N, Nc) PAL 

During active video, the digital chrominance 
(C) data is demodulated using sin and cos sub- 
carrier data, as shown in Figure 9.22, resulting 
in CbCr or UV data. For this design, the 11-bit 
reference subcarrier phase (see Figure 9.32) 
and the burst phase are the same (135°). 

For all the equations, 


@ = 2mF sc 


Foc = 4.43361875 MHz 
for (B, D, G, H, I, N) PAL 


Foc = 3.58205625 MHz 
for (Nc) PAL 


Foc = 3.57561149 MHz 
for (M) PAL 


Using a switched subcarrier waveform in 
the Cr or V channel also removes the PAL 
Switch modulation. Thus, [+2 cos wt] is used 
while the PAL Switch is a logical zero (burst 
phase = +135°) and [-2 cos wt] is used while 
the PAL Switch is a logical one (burst phase = 
225°). 


YUV Color Space 
As shown in Chapter 8, the chrominance signal 


is represented by: 


(U sin wt) + (V cos at) 
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U is obtained by multiplying the chrominance 
data by [2 sin wt] and V is obtained by multi- 
plying by [+2 cos at]: 


((U sin wt) + (Vcos wt)) (2 sin wt) 
=U-(Ucos 2at) + (V sin 2ot) 


((U sin wt) + (Vcos wt)) 4 2 cos at) 
=V+(U sin 2ot) + (V cos 2at) 


The 2mt components are removed by low- 
pass filtering, resulting in the U and V signals 
being recovered. The demodulation multipli- 
ers should ensure overflow and underflow con- 
ditions are saturated to the maximum and 
minimum values, respectively. The UV signals 
are then rounded to 9 bits plus sign and low- 
pass filtered. 

For (B, D, G, H, I, Nc) PAL, U has a nomi- 
nal range of 0 to +239, and V has a nominal 
range of 0 to +337. 

For (M, N) PAL, U has a nominal range of 
0 to +226, and V has a nominal range of 0 to 
+319. 


YCbCr Color Space 

If the decoder is based on the YCbCr color 
space, the chrominance signal for (B, D, G, H, 
I, Nc) PAL may be represented by: 


(Cb — 512) (0.533) (sin at) + 
(Cr — 512) (0.752) (cos wt) 


The chrominance signal for (M, N) PAL may 
be represented by: 


(Cb — 512) (0.504) sin wt + 
(Cr — 512) (0.711)cos wt 


In these cases, the values in the sin and 
cos ROMs are scaled by the reciprocal of the 
indicated values to allow the demodulator to 
generate Cb and Cr data directly, instead of U 
and V data. 


Hanover Bars 
If the locally generated subcarrier phase is 
incorrect, a line-to-line pattern known as 
Hanover bars results in which pairs of adjacent 
lines have a real and complementary hue 
error. As shown in Figure 9.23 with an ideal 
color of green, two adjacent lines of the display 
have a hue error (towards yellow), the next 
two have the complementary hue error 
(towards cyan), and so on. 

This can be shown by introducing a phase 
error (6) in the locally generated subcarrier: 


((U sin wt) + (V cos wt)) (2 sin (wt — 6)) 
= (U cos 8) —/+ (V sin 6) 


((U sin wt) + (V cos wt)) (+2 cos (wt — 6)) 
= (V cos 8) +/— (U sin 0) 


In areas of constant color, averaging equal con- 
tributions from even and odd lines (either visu- 
ally or using a delay line), cancels the 
alternating crosstalk component, leaving only 
a desaturation of the true component by [cos 
6]. 


Lowpass Filtering 
The decoder requires sharper roll-off filters 
than the encoder to ensure adequate suppres- 
sion of the sampling alias components. Note 
that with a 13.5-MHz sampling frequency, they 
start to become significant above 3 MHz. 

The demodulation process for (M) NTSC 
is shown spectrally in Figures 9.24 and 9.25; 
the process is similar for PAL. In both figures, 
(a) represents the spectrum of the video signal 
and (b) represents the spectrum of the subcar- 
rier used for demodulation. Convolution of (a) 
and (b), equivalent to multiplication in the time 
domain, produces the spectrum shown in (c), 
in which the baseband spectrum has been 
shifted to be centered about Fc and -Fgc. The 
chrominance is now a baseband signal, which 
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Figure 9.23. Example Display of Hanover Bars. Green is 


the ideal color. 


may be separated from the low-frequency lumi- 
nance, centered at Foc, by a lowpass filter. 

The lowpass filters after the demodulator 
are a compromise between several factors. 
Simply using a 1.3-MHz filter, such as the one 
shown in Figure 9.26, increases the amount of 
cross-color since a greater number of lumi- 
nance frequencies are included. When using 
lowpass filters with a passband greater than 
about 0.6 MHz for NTSC (4.2 — 3.58) or 1.07 
MHz for PAL (5.5 — 4.43), the loss of the upper 
sidebands of chrominance also introduces 
ringing and color difference crosstalk. If a 1.3- 
MHz lowpass filter is used, it may include 
some gain for frequencies between 0.6 MHz 
and 1.3 MHz to compensate for the loss of part 
of the upper sideband. 

Filters with a sharp cutoff accentuate 
chrominance edge ringing; for these reasons 
slow roll-off 0.6-MHz filters, such as the one 
shown in Figure 9.27, are usually used. These 
result in poorer color resolution but minimize 
cross-color, ringing, and color difference 
crosstalk on edges. 


If the decoder is to be used in a pro-video 
editing environment, the filters should have a 
maximum ripple of +0.1 dB in the passband. 
This is needed to minimize the cumulation of 
gain and loss artifacts due to the filters, espe- 
cially when multiple passes through the encod- 
ing and decoding processes are required. 


Luminance (Y) Processing 


To remove the sync and blanking information, 
Y data from either the Y/C separator or the 
luma ADC has the black level subtracted from 
it. At this point, negative Y values should be 
supported to allow test signals, keying infor- 
mation, and real-world video to pass through 
without corruption. 

A notch filter, with a center frequency of 
Foc, is usually optional. It may be used to 
remove any remaining chroma information 
from the Y data. The notch filter is especially 
useful to help “clean up” the Y data when comb 
filtering Y/C separation is used for PAL, due to 
the closeness of the PAL frequency packets. 
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Figure 9.24. Frequency Spectra for NTSC Digital Chrominance Demodulation (F, = 13.5 MHz, Fs¢ 
= 3.58 MHz). (a) Modulated chrominance. (b) Color subcarrier. (c) U and V spectrum produced by 
convolving (a) and (b). 
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Figure 9.25. Frequency Spectra for NTSC Digital Chrominance Demodulation (Fs = 12.27 MHz, Fsc¢ 
= 3.58 MHz). (a) Modulated chrominance. (b) Color subcarrier. (c) U and V spectrum produced by 
convolving (a) and (b). 
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Figure 9.26. Typical 1.3-MHz Lowpass Digital Figure 9.27. Typical 0.6-MHz Lowpass Digital 
Filter Characteristics. Filter Characteristics. 
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User Adjustments 


Contrast, Brightness, and Sharpness 
Programmable contrast, brightness, and 
sharpness adjustments may be implemented, 
as discussed in Chapter 7. In addition, color 
transient improvement may be used _ to 
improve the image quality. 


Hue 
A programmable hue adjustment may be 
implemented, as discussed in Chapter 7. 

Alternately, to reduce circuitry in the data 
path, the hue adjustment is usually imple- 
mented as a subcarrier phase offset that is 
added to the 11-bit reference subcarrier phase 
during the active video time (see Figure 9.32). 
The result is to shift the phase of the sin and 
cos subcarriers by a constant amount. An 11- 
bit hue adjustment allows adjustments in hue 
from 0° to 360°, in increments of 0.176°. 

Due to the alternating sign of the V compo- 
nent in PAL decoders, the sign of the phase off- 
set (8) is set to be the opposite of the V 
component. A negative sign of the phase offset 
(6) is equivalent to adding 180° to the desired 
phase shift. PAL decoders do not usually have 
a hue adjustment feature. 


Saturation 
A programmable saturation adjustment may be 
implemented, as discussed in Chapter 7. 
Alternately, to reduce circuitry in the data 
path, the saturation adjustment may be done 
on the sin and cos values in the demodulator. 
In either case, a “burst level error” signal 
and the user-programmable saturation value 
are multiplied together, and the result is used 
to adjust the gain or attenuation of the color dif- 


ference signals. The intent here is to minimize 
the amount of circuitry in the color difference 
signal path. The “burst level error” signal is 
used in the event the burst (and thus the mod- 
ulated chrominance information) is not at the 
correct amplitude and adjusts the saturation of 
the color difference signals appropriately. 

For more information on the “burst level 
error” signal, please see the Color Killer sec- 
tion. 


Automatic Flesh Tone Correction 

Flesh tone correction may be used in NTSC 
decoders since the eye is very sensitive to 
flesh tones, and the actual colors may become 
slightly corrupted during the broadcast pro- 
cess. If the grass is not quite the proper color 
of green, it is not noticeable; however, a flesh 
tone that has a green or orange tint is unac- 
ceptable. Since the flesh tones are located 
close to the +I axis, a typical flesh tone correc- 
tor looks for colors in a specific area (Figure 
9.28), and any colors within that area are made 
a color that is closer to the flesh tone. 

A simple flesh tone corrector may halve 
the Q value for all colors that have a corre- 
sponding +I value. However, this implementa- 
tion also changes nonflesh tone colors. A more 
sophisticated implementation is if the color has 
a value between 25% and 75% of full-scale, and 
is within +30° of the +I axis, then Q is halved. 
This moves any colors within the flesh tone 
region closer to “ideal” flesh tone. 

It should be noted that the phase angle for 
flesh tone varies between companies. Phase 
angles from 116° to 126° are used; however, 
using 123° (the +I axis) simplifies the process- 
ing. 
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Figure 9.28. Typical Flesh Tone Color Range. 


Color Killer 
If a color burst of 12.5% or less of ideal ampli- 
tude is detected for 128 consecutive scan lines, 
the color difference signals should be forced to 
zero. Once a color burst of 25% or more of ideal 
amplitude is detected for 128 consecutive scan 
lines, the color difference signals may again be 
enabled. This hysteresis prevents wandering 
back and forth between enabling and disabling 
the color information in the event the burst 
amplitude is borderline. 

The burst level may be determined by forc- 
ing all burst samples positive and sampling the 


result multiple times to determine an average 
value. This should be averaged over three scan 
lines to limit line-to-line variations. 

The “burst level error” is the ideal ampli- 
tude divided by the average result. If no burst 
is detected, this should be used to force the 
color difference signals to zero and to disable 
any filtering in the luminance path, allowing 
maximum resolution luminance to be output. 

Providing the ability to optionally force the 
color decoding on or off is useful in some appli- 
cations, such as video editing. 
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Color Space Conversion 


YUV or YIQ data is usually converted to 
YCbCr or R’G'B’ data before being output 
from the decoder. If converting to R’G’B’ data, 
the R’G’B’ data must be clipped at the 0 and 
1023 values to prevent wrap-around errors. 


(M) NTSC, (M, N) PAL 


YUV Color Space Processing 

Modern decoder designs are now based on the 
YUV color space, For these decoders, the YUV 
to YCbCr equations are: 


Yoo = 1.691Y + 64 


Cb = [1.984U cos 6p] + [1.984V sin Op] 
+512 


Cr = [1.406U cos 6p] + [1.406V sin Op] 
+512 


To generate R’G’B’ data with a range of 0- 
1023, the YUV to R’G’B’ equations are: 


R’ = 1.975Y + [2.251U cos Op] + 
[2.251V sin Og] 
G’ = 1.975Y - 0.779U - 1.146V 


B’ = 1.975Y + [4.013U cos 6p] 
+ [4.013V sin 6p] 


To generate R’G’B’ data with a nominal range 
of 64-940 for pro-video applications, the YUV 
to R’G’B’ equations are: 


R’ = 1.691Y + 1.928V + 64 


G’ = 1.691Y — 0.667U — 0.982V + 64 
B’ = 1.691Y + 3.436U + 64 


The ideal values for 0p and Op are 90° and 0°, 
respectively. However, for consumer televi- 
sions sold in the United States, 6p and Op usu- 
ally have values of 110° and 0°, respectively, or 


100° and —10°, respectively, to reduce the visi- 
bility of differential phase errors, at the cost of 
color accuracy. 


YIQ Color Space Processing 

For older NTSC decoder designs based on the 
YIQ color space, the YIQ to YCbCr equations 
are: 


Y6o1 = 1.692Y + 64 
Cb = -1.081I + 1.664Q + 512 
Cr = 1.1811 + 0.765Q + 512 


To generate R’G’B’ data with a range of 0- 
1023, the YIQ to R’G’B’ equations are: 

R’ = 1.975Y + 1.8871 + 1.224Q 

G’ = 1.975Y — 0.5361 — 1.278Q 

B’ =1.975Y — 2.1891 + 3.367Q 


To generate R’G’B’ data with a nominal range 
of 64-940 for pro-video applications, the YIQ to 
R’G’B’ equations are: 


R’ = 1.691Y + 1.6161 + 1.048Q + 64 
G’ = 1.691Y — 0.4591 - 1.094Q + 64 
B’ = 1.691Y - 1.8741 + 2.883Q + 64 


YCbCr Color Space Processing 

If the design is based on the YUV color space, 
the U and V conversion to Cb and Cr may be 
avoided by scaling the sin and cos values dur- 
ing the demodulation process, or scaling the 
color difference lowpass filter coefficients. 


NTSC-J 

Since the version of (M) NTSC used in Japan 
has a 0 IRE blanking pedestal, the color space 
conversion equations are slightly different 
than those for standard (M) NTSC. 
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YUV Color Space Processing 

Modern decoder designs are now based on the 
YUV color space, For these decoders, the YUV 
to YCbCr equations are: 


Y6o1 = 1.564Y + 64 
Cb = 1.835U + 512 


Cr = [1.301U cos 6p] + [1.301V sin 6p] 
+512 


To generate R’G’B’ data with a range of 0- 
1023, the YUV to R’G’B’ equations are: 


R’ = 1.827Y + [2.082U cos 0] + 
[2.082V sin Og] 

G’ = 1.827Y - 0.721U - 1.060V 

B’ = 1.827Y + 3.712U 


To generate R’G’B’ data with a nominal range 
of 64-940 for pro-video applications, the YUV 
to R’G’B’ equations are: 


R’ = 1.564Y + 1.783V + 64 
G’ = 1.564Y — 0.617U — 0.908V + 64 
B’ = 1.564Y + 3.179U + 64 


The ideal value for Op is 90°. However, for tele- 
visions sold in the Japan, Op usually has a value 
of 95° to reduce the visibility of differential 
phase errors, at the cost of color accuracy. 


YIQ Color Space Processing 

For older NTSC decoder designs based on the 
YIQ color space, the YIQ to YCbCr equations 
are: 


Yoo = 1.565Y + 64 
Cb = -1.000I + 1.539Q + 512 
Cr = 1.0901 + 0.708Q + 512 


To generate R’G’B’ data with a range of 0- 
1023, the YIQ to R’G’B’ equations are: 

R’ = 1.827Y + 1.7461 + 1.132Q 

G’ = 1.827Y — 0.4961 — 1.182Q 

B’ =1.827Y — 2.0241 + 3.115Q 


To generate R’G’B’ data with a nominal range 
of 64-940 for pro-video applications, the YIQ to 
R’G’B’ equations are: 


R’ = 1.564Y + 1.4951 + 0.970Q + 64 
G’ = 1.564Y - 0.4251 - 1.012Q + 64 
B’ = 1.564Y - 1.7341 + 2.667Q + 64 


YCbCr Color Space Processing 

If the design is based on the YUV color space, 
the U and V conversion to Cb and Cr may be 
avoided by scaling the sin and cos values dur- 
ing the demodulation process, or scaling the 
color difference lowpass filter coefficients. 


(B, D, G, H, I, Nc) PAL 


YUV Color Space Processing 
The YUV to YCbCr equations are: 


Y6o1 = 1.599Y + 64 
Cb = 1.875U + 512 
Cr = 1.329V + 512 


To generate R’G’B’ data with a range of 0- 
1023, the YUV to R’G’B’ equations are: 

R’ = 1.867Y + 2.128V 

G’ = 1.867Y - 0.737U — 1.084V 

B’ = 1.867Y + 3.793U 
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To generate R’G’B’ data with a nominal range 
of 64-940 for pro-video applications, the YUV 
to R’G’B’ equations are: 

R’ = 1.599Y + 1.822V + 64 

G’ = 1.599Y — 0.631U — 0.928V + 64 

B’ = 1.599Y + 3.248U + 64 


YCbCr Color Space Processing 

The U and V conversion to Cb and Cr may be 
avoided by scaling the sin and cos values dur- 
ing the demodulation process, or scaling the 
color difference lowpass filter coefficients. 


Genlocking 


The purpose of the genlock circuitry is to 
recover a sample clock and the timing control 
signals (such as horizontal sync, vertical 
sync, and the color subcarrier) from the video 
signal. Since the original sample clock is not 
available, it is usually generated by multiply- 
ing the horizontal line frequency, Fy, by the 
desired number of samples per line, using a 
phase-lock loop (PLL). Also, the color subcar- 
rier must be regenerated and locked to the 
color subcarrier of the video signal being 
decoded. 

There are, however, several problems. 
Video signals may contain noise, making the 
determination of sync edges unreliable. The 
amount of time between horizontal sync edges 
may vary slightly each line, particularly in ana- 
log video tape recorders (VCRs) due to 
mechanical limitations. For analog VCRs, 
instantaneous line-to-line variations are up to 
+100 ns; line variations between the beginning 
and end of a field are up to +5 us. When analog 
VCRs are in a “special feature” mode, such as 
fast-for warding or still-picture, the amount of 


time between horizontal sync signals may vary 
up to +20% from nominal. 

Vertical sync, as well as horizontal sync, 
information must be recovered. Unfortu- 
nately, analog VCRs, in addition to destroying 
the SCH phase relationship, perform head 
switching at field boundaries, usually some- 
where between the end of active video and the 
start of vertical sync. When head switching 
occurs, one video signal (field n) is replaced by 
another video signal (field n + 1) which has an 
unknown time offset from the first video sig- 
nal. There may be up to a+1/2 line variation in 
vertical timing each field. As a result, longer- 
than-normal horizontal or vertical syncs may 
be generated. 

By monitoring the horizontal line timing, it 
is possible to automatically determine whether 
the video source is in the “normal” or “special 
feature” mode. During “normal” operation, the 
horizontal line time typically varies by no more 
than +5 us over an entire field. Line timing out- 
side this +5 us window may be used to enable 
“special feature” mode timing. Hysteresis 
should be used in the detection algorithm to 
prevent wandering back and forth between the 
“normal” and “special feature” operations in 
the event the video timing is borderline 
between the two modes. A typical circuit for 
performing the horizontal and vertical sync 
detection is shown in Figure 9.29. 

In the absence of a video signal, the 
decoder should be designed to optionally free- 
run, continually generating the video timing to 
the system, without missing a beat. During the 
loss of an input signal, any automatic gain cir- 
cuits should be disabled and the decoder 
should provide the option to either be transpar- 
ent (so the input source can be monitored), to 
auto-freeze the output data (to compensate for 
short duration dropouts), or to autoblack the 
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Figure 9.29. Sync Detection and Phase Comparator Circuitry. 
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output data (to avoid potential problems driv- 
ing a mixer or VCR). 


Horizontal Syne Detection 

Early decoders typically used analog sync slic- 
ing techniques to determine the midpoint of 
the leading edge of the sync pulse and used a 
PLL to multiply the horizontal frequency rate 
up to the sample clock rate. However, the lack 
of accuracy of the analog sync slicer, combined 
with the limited stability of the PLL, resulted in 
sample clock jitter and noise amplification. 
When using comb filters for Y/C separation, 
the long delay between writing and reading the 
video data means that even a small sample 
clock frequency error results in a delay that is 
a significant percentage of the subcarrier 
period, negating the effectiveness of the comb 
filter. 


Coarse Horizontal Sync Locking 

The coarse sync locking enables a faster lock- 
up time to be achieved. Digitized video is low- 
pass filtered to about 0.5 MHz to remove high- 
frequency information, such as noise and color 
subcarrier information. Performing the sync 
detection on lowpass filtered data also provides 
edge shaping in the event that fast sync edges 
(rise and fall times less than one clock cycle) 
are present. 

An 11-bit horizontal counter is incre- 
mented each sample clock cycle, resetting to 
001y after counting up to the HCOUNT value, 
where HCOUNT specifies the total number of 
samples per line. A value of 001y indicates that 
the beginning of a horizontal sync is expected. 
When the horizontal counter value is 
(HCOUNT - 64), a sync gate is enabled, allow- 
ing recovered sync information to be detected. 


Up to five consecutive missing sync pulses 
should be detected before any correction to 
the clock frequency or other adjustments are 
done. Once sync information has been 
detected, the sync gate is disabled until the 
next time the horizontal counter value is 
(HCOUNT -— 64). This helps filter out noise, 
serration, and equalization pulses. If the lead- 
ing edge of recovered horizontal sync is not 
within +64 clock cycles (approximately +5 ts) 
of where it is expected to be, the horizontal 
counter is reset to 001y to realign the edges 
more closely. 

Additional circuitry may be included to 
monitor the width of the recovered horizontal 
sync pulse. If the horizontal sync pulse is not 
approximately the correct pulse width, ignore 
it and treat it as a missing sync pulse. 

If the leading edge of recovered horizontal 
sync is within +64 sample clock cycles (approx- 
imately +5 us) of where it is expected to be, the 
fine horizontal sync locking circuitry is used to 
fine-tune the timing. 


Fine Horizontal Sync Locking 

One-half the sync amplitude is subtracted from 
the 0.5-MHz lowpass-filtered video data so the 
sync timing reference point (50% sync ampli- 
tude) is at zero. 

The leading horizontal sync edge may be 
determined by summing a series of weighted 
samples from the region of the sync edge. To 
perform the filtering, the weighting factors are 
read from a ROM by a counter triggered by 
the horizontal counter. When the central 
weighting factor (AO) is coincident with the 
50% amplitude point of the leading edge of 
sync, the result integrates to zero. Typical 
weighting factors are: 
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AO = 102/4096 
Al = 90/4096 
A2 = 63/4096 
A3 = 34/4096 
A4 = 14/4096 
AS = 5/4096 
A6 = 2/4096 


This arrangement uses more of the timing 
information from the sync edge and sup- 
presses noise. Note that circuitry should be 
included to avoid processing the trailing edge 
of horizontal sync. 

Figure 9.30 shows the operation of the fine 
sync phase comparator. Figure 9.30a shows 
the leading sync edge for NTSC. Figure 9.30b 
shows the weighting factors being generated, 
and when multiplied by the sync information, 
produces the waveform shown in Figure 9.30c. 
When the AO coefficient is coincident with the 
50% amplitude point of sync, the waveform 
integrates to zero. Distortion of sync edges, 
resulting in the locking point being slightly 
shifted, is minimized by the lowpass filtering, 
effectively shaping the sync edges prior to pro- 
cessing. 


Sample Clock Generation 

The horizontal sync phase error signal from 
Figure 9.29 is used to adjust the frequency ofa 
line-locked PLL, as shown in Figure 9.31. A 
line-locked PLL always generates a constant 
number of clock cycles per line, regardless of 
any line time variations. The free-running fre- 
quency of the PLL should be the nominal sam- 
ple clock frequency required (for example, 
13.5 MHz). 

Using a VCO-based PLL has the advantage 
of a wider range of sample clock frequency 
adjustments, useful for handling video timing 
variations outside the normal video specifica- 
tions. A disadvantage is that, due to jitter in the 


sample clock, there may be visible hue arti- 
facts and poor Y/C separation. 

A VCXO-based PLL has the advantage of 
minimal sample clock jitter. However, the sam- 
ple clock frequency range may be adjusted 
only a small amount, limiting the ability of the 
decoder to handle nonstandard video timing. 

Ideally, with either design, the rising edge 
of the sample clock is aligned with the half 
amplitude point of the leading edge of horizon- 
tal sync, and a fixed number of sample clock 
cycles per line (HCOUNT) are always gener- 
ated. 

An alternate method is to asynchronously 
sample the video signal with a fixed-frequency 
clock (for example, 13.5 MHz). Since in this 
case the sample clock is not aligned with hori- 
zontal sync, there is a phase difference 
between the actual sample position and the 
ideal sample position. As with the conventional 
genlock solution, this phase difference is 
determined by the difference between the 
recovered and expected horizontal syncs. 

The ideal sample position is defined to be 
aligned with a sample clock generated by a 
line-locked PLL. Rather than controlling the 
sample clock frequency, the horizontal sync 
phase error signal is used to control interpola- 
tion between two samples of data to generate 
the ideal sample value. If using comb filtering 
for Y/C separation, the digitized composite 
video may be interpolated to generate the ideal 
sample points, providing better Y/C separation 
by aligning the samples more precisely. 


Vertical Sync Detection 

Digitized video is lowpass filtered to about 0.5 
MHz to remove high-frequency information, 
such as noise and color subcarrier informa- 
tion. The 10-bit vertical counter is incremented 
by each expected horizontal sync, resetting to 
001, after counting up to 525 or 625. A value of 
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Figure 9.30. Fine Lock Phase Comparator Waveforms. 
(a) The NTSC sync leading edge. (b) The series of 
weighting factors. (c) The weighted leading edge 
samples. 
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Figure 9.31. Typical Line-Locked Sample Clock Generation. 


001y indicates that the beginning of a vertical 
sync for Field 1 is expected. 

The end of vertical sync intervals is 
detected and used to set the value of the verti- 
cal counter according to the mode of opera- 
tion. By monitoring the relationship of 
recovered vertical and horizontal syncs, Field 
1 vs. Field 2 information is detected. If a recov- 
ered horizontal sync occurs more than 64, but 
less than (HCOUNT/2), clock cycles after 
expected horizontal sync, the vertical counter 
is not adjusted to avoid double incrementing 
the vertical counter. If a recovered horizontal 
syne occurs (HCOUNT/2) or more clock 
cycles after the vertical counter has been 
incremented, the vertical counter is again 
incremented. 

During “special feature” operation, there is 
no longer any correlation between the vertical 
and horizontal timing information, so Field 1 
vs. Field 2 detection cannot be done. Thus, 
every other detection of the end of vertical 
syne should set the vertical counter accord- 
ingly in order to synthesize Field 1 and Field 2 
timing. 


Subcarrier Generation 
As with the encoder, the color subcarrier is 
generated from the sample clock using a DTO 
(Figure 9.32), and the same frequency relation- 
ships apply as those discussed in the encoder 
section. 

Unlike the encoder, the phase of the gener- 
ated subcarrier must be continuously adjusted 


to match that of the video signal being 
decoded. 

The subcarrier locking circuitry phase 
compares the generated subcarrier and the 
incoming subcarrier, resulting in an Foc error 
signal indicating the amount of phase error. 
This Foc error signal is added to the [p] value 
to continually adjust the step size of the DTO, 
adjusting the phase of the generated subcar- 
rier to match that of the video signal being 
decoded. 

As a 22-bit single-stage DTO is used to 
divide down the sample clock to generate the 
subcarrier in Figure 9.32, the [p] value is 
determined as follows: 


Foc¢/Fs = (P/4194303) = (P/ (27? - 1)) 


where Fcc = the desired subcarrier frequency 
and Fs = the sample clock rate. Some values of 
[p] for popular sample clock rates are shown in 
Table 9.10. 


Subcarrier Locking 

The purpose of the subcarrier locking cir- 
cuitry (Figure 9.33) is to phase lock the gener- 
ated color subcarrier to the color subcarrier of 
the video signal being decoded. 

Digital composite video (or digital chromi- 
nance video) has the blanking level subtracted 
from it. It is also gated with a “burst gate” to 
ensure that the data has a value of zero outside 
the burst time. The burst gate signal should be 
timed to eliminate the edges of the burst, 
which may have transient distortions that will 
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Figure 9.32. Chrominance Subcarrier Generator. 
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Typical Total Samples 
A aii per Scan Line P 
PP (HCOUNT) 
13.5 MHz (M) NTSC 858 1,112,126 
13.5 MHz (B, D, G, H, I) PAL 864 1,377,477 
12.27 MHz (M) NTSC 780 1,223,338 
14.75 MHz (B, D, G, H, I) PAL 944 1,260,742 


Table 9.10. Typical HCOUNT and P Values for the 1-Stage 22-Bit DTO in 
Figure 9.32. 
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Figure 9.33. Subcarrier Phase Comparator Circuitry. 
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reduce the accuracy of the phase measure- 
ment. 

The color burst data is phase compared to 
the locally generated burst. Note that the sign 
information also must be compared so lock will 
not occur on 180° out-of-phase signals. The 
burst accumulator averages the 16 samples, 
and the accumulated values from two adjacent 
lines are averaged to produce the error signal. 
When the local subcarrier is correctly phased, 
the accumulated values from alternate lines 
cancel, and the phase error signal is zero. The 
error signal is sampled at the line rate and pro- 
cessed by the loop filter, which should be 
designed to achieve a lock-up time of about ten 
lines (50 or more lines may be required for 
noisy video signals). It is desirable to avoid 
updating the error signal during vertical inter- 
vals due to the lack of burst. The resulting Foc 
error signal is used to adjust the DTO that gen- 
erates the local subcarrier (Figure 9.32). 

During PAL operation, the phase detector 
also recovers the PAL Switch information used 
in generating the switched V subcarrier. The 
PAL Switch D flip-flop is synchronized to the 
incoming signal by comparing the local switch 
sense with the sign of the accumulated burst 
values. If the sense is consistently incorrect for 
16 lines, then the flip-flop is reset. 

Note the subcarrier locking circuit should 
be able to handle short-term frequency varia- 
tions (over a few frames) of +200 Hz, long-term 
frequency variations of +500 Hz, and color 
burst amplitudes of 25-200% of normal with 
short-term amplitude variations (over a few 
frames) of up to 5%. The lock-up time of 10 
lines is desirable to accommodate video sig- 
nals that may have been incorrectly edited 
(i.e., not careful about the SCH phase relation- 
ship) or nonstandard video signals due to 
freeze-framing, special effects, and so on. The 
10 lines enable the subcarrier to be locked 


before the active video time, ensuring correct 
color representation at the beginning of the 
picture. 


Video Timing Generation 


HSYNC# (Horizontal Sync) Generation 

An 11-bit horizontal counter is incremented on 
each rising edge of the sample clock. The 
count is monitored to determine when to gen- 
erate the burst gate, HSYNC# output, horizon- 
tal blanking, etc. Typically, each time the 
counter is reset to 001y, the HSYNC# output is 
asserted. The exact timing of HSYNC# is 
dependent on the video interface used, as dis- 
cussed in Chapter 6. 


H (Horizontal Blanking) Generation 

A horizontal blanking signal, H, may be imple- 
mented to specify when the horizontal blank- 
ing interval occurs. The timing of H is 
dependent on the video interface used, as dis- 
cussed in Chapter 6. 

The horizontal blank timing may be user 
programmable by incorporating start and stop 
blank registers. The values of these registers 
are compared to the horizontal counter value, 
and used to assert and negate the H control 
signal. 


VSYNC# (Vertical Sync) Generation 

A 10-bit vertical counter is incremented on 
each rising edge of HSYNC#. Typically, each 
time the counter is reset to 001y, the VSYNC# 
output is asserted. The exact timing of 
VSYNC# is dependent on the video interface 
used, as discussed in Chapter 6. 


F (FIELD) Generation 
A field signal, F, may be implemented to spec- 
ify whether Field 1 or Field 2 is being decoded. 
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The exact timing of F is dependent on the 
video interface used, as discussed in Chapter 
6. 

In instances where the output of an analog 
VCR is being decoded, and the VCR is in a spe- 
cial effects mode (such as still or fast-forward), 
there is no longer enough timing information 
to determine Field 1 vs. Field 2 timing. Thus, 
the Field 1 and Field 2 timing as specified by 
the VSYNC#/HSYNC# relationship (or the F 
signal) should be synthesized and may not 
reflect the true field timing of the video signal 
being decoded. 


V (Vertical Blanking) Generation 

A vertical blanking signal, V, may be imple- 
mented to specify when the vertical blanking 
interval occurs. The exact timing of V is depen- 
dent on the video interface used, as discussed 
in Chapter 6. 

The vertical blank timing may be user pro- 
grammable by incorporating start and stop 
blank registers. The values of these registers 
are compared to the vertical counter value, and 
used to assert and negate the V control signal. 


BLANK# Generation 
The composite blanking signal, BLANK#, is 
the logical NOR of the H and V signals. 

While BLANK# is asserted, RGB data may 
be forced to be a value of 0. YCbCr data may be 
forced to an 8-bit value of 16 for Y and 128 for 
Cb and Cr. Alternately, the RGB or YCbCr data 
outputs may not be blanked, allowing vertical 
blanking interval (VBI) data, such as closed 
captioning, teletext, widescreen signalling and 
other information to be output. 


Field Identification 

Although the timing relationship between the 
horizontal sync (HSYNC#) and vertical sync 
(VSYNC#) signals, or the F signal, may be 


used to specify whether a Field 1 vs. Field 2 is 
being decoded, one or two additional signals 
may be used to specify which one of four or 
eight fields is being decoded, as shown in 
Table 9.7. We refer to these additional control 
signals as FIELD_0 and FIELD_1. 

FIELD_0 should change state at the begin- 
ning of VSYNC#, or coincident with F, during 
fields 1, 3, 5, and 7. FIELD_1 should change 
state at the beginning of VSYNC#, or coinci- 
dent with F, during fields 1 and 5. 


NTSC Field Identification 

The beginning of fields 1 and 3 may be deter- 
mined by monitoring the relationship of the 
subcarrier phase relative to sync. As shown in 
Figure 8.5, at the beginning of field 1, the sub- 
carrier phase is ideally 0° relative to sync; at 
the beginning of field 3, the subcarrier phase is 
ideally 180° relative to sync. 

In the real world, there is a tolerance in the 
SCH phase relationship. For example, 
although the ideal SCH phase relationship may 
be perfect at the source, transmitting the video 
signal over a coaxial cable may result in a shift 
of the SCH phase relationship due to cable 
characteristics. Thus, the ideal phase plus or 
minus a tolerance should be used. Although 
+40° (NTSC) or +20° (PAL) is specified as an 
acceptable tolerance by the video standards, 
many decoder designs use a tolerance of up to 
+80°. 

In the event that a SCH phase relationship 
not within the proper tolerance is detected, the 
decoder should proceed as if nothing were 
wrong. If the condition persists for several 
frames, indicating that the video source no 
longer may be a “stable” video source, opera- 
tion should change to that for an “unstable” 
video source. 

For “unstable” video sources that do not 
maintain the proper SCH relationship (such as 
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analog VCRs), synthesized FIELD_0 and 
FIELD_1 outputs should be generated (for 
example, by dividing the F output signal by two 
and four) in the event the signal is required for 
memory addressing or downstream process- 


ing. 


PAL Field Identification 

The beginning of fields 1 and 5 may be deter- 
mined by monitoring the relationship of the -U 
component of the extrapolated burst relative to 
sync. As shown in Figure 8.16, at the begin- 
ning of field 1, the phase is ideally 0° relative to 
sync; at the beginning of field 5, the phase is 
ideally 180° relative to sync. Either the burst 
blanking sequence or the subcarrier phase 
may be used to differentiate between fields 1 
and 3, fields 2 and 4, fields 5 and 7, and fields 6 
and 8. All of the considerations discussed for 
NTSC in the previous section also apply for 
PAL. 


Auto-Detection of Video Signal Type 


If the decoder can automatically detect the 
type of video signal being decoded, and config- 
ure itself automatically, the user will not have 
to guess at the type of video signal being pro- 
cessed. This information can be passed via sta- 
tus information to the rest of the system. 

If the decoder detects less than 575 lines 
per frame for at least 16 consecutive frames, 
the decoder can assume the video signal is 
(M) NTSC or (M) PAL. First, assume the 
video signal is (M) NTSC as that is much more 
popular. If the vertical and horizontal timing 
remain locked, but the decoder is unable to 
maintain subcarrier locking, the video signal 
may be (M) PAL. In that case, try (M) PAL 
operation and verify the burst timing. 

If the decoder detects more than 575 lines 
per frame for at least 16 consecutive frames, it 


can assume the video signal is (B, D, G, H, J, 
N, Nc) PAL or a version of SECAM. 

First, assume the video signal is (B, D, G, 
H, I, N) PAL. If the vertical and horizontal tim- 
ing remain locked, but the decoder is unable to 
maintain a subcarrier lock, it may mean the 
video signal is (Nc) PAL or SECAM. In that 
case, try SECAM operation (as that is much 
more popular), and if that doesn’t subcarrier 
lock, try (Nc) PAL operation. 

If the decoder detects a video signal format 
that it cannot lock to, this should be indicated 
so user can be notified. 

Note that auto-detection cannot be per- 
formed during “special feature” modes of ana- 
log VCRs, such as fast-forwarding. If the 
decoder detects a “special feature” mode of 
operation, it should disable the auto-detection 
circuitry. Auto-detection should only be done 
when a video signal has been detected after 
the loss of an input video signal. 


Y/C Separation Techniques 


The encoder typically combines the luminance 
and chrominance signals by simply adding 
them together; the result is that chrominance 
and high-frequency luminance signals occupy 
the same portion of the frequency spectrum. 
As a result, separating them in the decoder is 
difficult. When the signals are decoded, some 
luminance information is decoded as color 
information (referred to as cross-color), and 
some chrominance information remains in the 
luminance signal (referred to as cross-lumi- 
nance). Due to the stable performance of digi- 
tal decoders, much more complex separation 
techniques can be used than is possible with 
analog decoders. 

The presence of crosstalk is bad news in 
editing situations; crosstalk components from 
the first decoding are encoded, possibly caus- 
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ing new or additional artifacts when decoded 
the next time. In addition, when a still frame is 
captured from a decoded signal, the frozen 
residual subcarrier on edges may beat with the 
subcarrier of any following encoding process, 
resulting in edge flicker in colored areas. 
Although the crosstalk problem cannot be 
solved entirely at the decoder, more elaborate 
Y/C separation minimizes the problem. 

If the decoder is used in an editing envi- 
ronment, the suppression of cross-luminance 
and cross-chrominance is more important than 
the appearance of the decoded picture. When a 
picture is decoded, processed, encoded, and 
again decoded, cross-effects can introduce 
substantial artifacts. It may be better to limit 
the luminance bandwidth (to reduce cross- 
luminance), producing “softer” pictures. Also, 
limiting the chrominance bandwidth to less 
than 1 MHz reduces cross-color, at the 
expense of losing chrominance definition. 

Complementary Y/C_ separation pre- 
serves all of the input signal. If the separated 
chrominance and luminance signals are added 
together again, the original composite video 
signal is generated. 

Noncomplementary Y/C separation intro- 
duces some irretrievable loss, resulting in gaps 
in the frequency spectrum if the separated 
chrominance and luminance signals are again 
added together to generate a composite video 
signal. The loss is due to the use of narrower 
filters to reduce cross-color and cross-lumi- 
nance. Therefore, noncomplementary filtering 
is usually unsuitable when multiple encoding 
and decoding operations must be performed, 
as the frequency spectrum gaps continually 
increase as the number of decoding operations 
increase. It does, however, enable the “tweak- 
ing” of luminance and chrominance response 
for optimum viewing. 


Simple Y/C Separation 
With all of these implementations, there is no 
loss of vertical chrominance resolution, but 
there is also no suppression of cross-color. For 
PAL, line-to-line errors due to differential 
phase distortion are not suppressed, resulting 
in the vertical pattern known as Hanover bars. 
Noticeable artifacts of simple Y/C separa- 
tors are color artifacts on vertical edges. These 
include color ringing, color smearing, and the 
display of color rainbows in place of high-fre- 
quency gray-scale information. 


Lowpass and Highpass Filtering 

The most basic Y/C separator assumes fre- 
quencies below a certain point are luminance 
and above this point are chrominance. An 
example of this simple Y/C separator is shown 
in Figure 9.34. 

Frequencies below 3.0 MHz (NTSC) or 3.8 
MHz (PAL) are assumed to be luminance. Fre- 
quencies above these are assumed to be 
chrominance. Not only is high-frequency lumi- 
nance information lost, but it is assumed to be 
chrominance information, resulting in cross- 
color. 


Notch Filtering 
Although broadcast NTSC and PAL systems 


are strictly bandwidth-limited, this may not be 
true of other video sources. Luminance infor- 
mation may be present all the way out to 6 or 7 
MHz or even higher. For this reason, the 
designs in Figure 6.35 are usually more appro- 
priate, as they allow high-frequency luminance 
to pass, resulting in a sharper picture. 

Many designs based on the notch filter 
also incorporate comb filters in the Y and color 
difference data paths to reduce cross-color and 
cross-luma artifacts. However, the notch filter 
still limits the overall Y/C separation quality. 
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Figure 9.34. Typical Simple Y/C Separator. 
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Figure 9.35. Typical Simple Y/C Separator. (a) Complementary 
filtering. (b) Noncomplementary filtering. 
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PAL Considerations 

As mentioned before, PAL uses “normal” and 
“inverted” scan lines, referring to whether the 
V component is normal or inverted, to help 
correct color shifting effects due to differential 
phase distortions. 

For example, differential phase distortion 
may cause the green vector angle on “normal” 
scan lines to lag by 45° from the ideal 241° 
shown in Figure 8.11. This results in a vector at 
196°, effectively shifting the resulting color 
towards yellow. On “inverted” scan lines, the 
vector angle also will lag by 45° from the ideal 
120° shown in Figure 8.12. This results in a 
vector at 75°, effectively shifting the resulting 
color towards cyan. 


PAL Delay Line 
Figure 9.36, made by flipping Figure 8.12 180° 


about the U axis and overlaying the result onto 
Figure 8.11, illustrates the cancellation of the 
phase errors. The average phase of the two 
errors, 196° on “normal” scan lines and 286° 
on “inverted” scan lines, is 241°, which is the 
correct phase for green. For this reason, sim- 
ple PAL decoders usually use a delay line (or 
line store) to facilitate averaging between two 
scan lines. 

Using delay lines in PAL Y/C separators 
has unique problems. The subcarrier refer- 
ence changes by —90° (or 270°) over one line 
period, and the V subcarrier is inverted on 
alternate lines. Thus, there is a 270° phase dif- 
ference between the input and output of a line 
delay. If we want to do a simple addition or sub- 
traction between the input and output of the 
delay line to recover chrominance information, 
the phase difference must be 0° or 180°. And 
there is still that switching V floating around. 
Thus, we would like to find a way to align the 
subcarrier phases between lines and compen- 
sate for the switching V. 


Simple circuits, such as the noncomple- 
mentary Y/C separator shown in Figure 9.37, 
use a delay line that is not a whole line (283.75 
subcarrier periods), but rather 284 subcarrier 
periods. This small difference acts as a 90° 
phase shift at the subcarrier frequency. 

Since there are an integral number of sub- 
carrier periods in the delay, the U subcarriers 
at the input and output of the 284 Tsc delay 
line are in phase, and they can simply be added 
together to recover the U subcarrier. The V 
subcarriers are 180° out of phase at the input 
and output of the 284 Tsc delay line, due to the 
switching V, so the adder cancels them out. 
Any remaining high-frequency vertical V com- 
ponents are rejected by the U demodulator. 

Due to the switching V, subtracting the 
input and output of the 284 Toc delay line 
recovers the V subcarrier while cancelling the 
U subcarrier. Any remaining high-frequency 
vertical U components are rejected by the V 
demodulator. 

Since the phase shift through the 284 Tsc 
delay line is a function of frequency, the sub- 
carrier sidebands are not phase shifted exactly 
90°, resulting in hue errors on vertical chromi- 
nance transitions. Also, the chrominance and 
luminance are not vertically aligned since the 
chrominance is shifted down by one-half line. 


PAL Modifier 

Although the performance of the circuit in Fig- 
ure 9.37 usually is adequate, the 284 Tsc delay 
line may be replaced by a line delay followed 
by a PAL modifier, as shown in Figure 9.38. 
The PAL modifier provides a 90° phase shift 
and inversion of the V subcarrier. Chromi- 
nance from the PAL modifier is now in phase 
with the line delay input, allowing the two to be 
combined using a single adder and share a 
common path to the demodulators. The aver- 
aging sacrifices some vertical resolution, how- 
ever; Hanover bars are suppressed. 
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Figure 9.37. Single Delay Line PAL Y/C Separator. 


NTSC and PAL Digital Decoding 433 


COMPOSITE 


VIDEO 


BANDPASS FILTER 
3.1-5.7 MHZ 


PAL MODIFIER 


2 SIN 20T 


BANDPASS FILTER 
3.1-5.7 MHZ 


O- 0.5X 


} 


U SIN@T + VCOSaT & 


U SIN @T -/+ V COS oT 


—U SIN 3@T-/+ V COS 30T 


Figure 9.38. Single Line Delay PAL Y/C Separator Using a PAL Modifier. 


Since the chrominance at the demodulator 
input is in phase with the composite video, it 
can be used to cancel the chrominance in the 
composite signal to leave luminance. However, 
the chrominance and luminance are still not 
vertically aligned since the chrominance is 
shifted down by one-half line. 

The PAL modifier produces a luminance 
alias centered at twice the subcarrier fre- 
quency. Without the bandpass filter before the 
PAL modifier and the averaging between lines, 
mixing the original and aliased luminance com- 
ponents would result in a 12.5-Hz beat fre- 
quency, noticeable in high-contrast areas of the 
picture. 


2D Comb Filtering 

In the previous Y/C separators, high-fre- 
quency luminance information is treated as 
chrominance information; no attempt is made 
to differentiate between the two. As a result, 
the luminance information is interpreted as 
chrominance information (cross-color) and 


passed on to the chroma demodulator to 
recover color information. The demodulator 
cannot differentiate between chrominance and 
high-frequency luminance, so it generates 
color where color should not exist. Thus, occa- 
sional display artifacts are generated. 

2D (or intra-field) comb filtering attempts 
to improve the separation of chrominance and 
luminance at the expense of reduced vertical 
resolution. Comb filters get their name by hav- 
ing luminance and chrominance frequency 
responses that look like a comb. Ideally, these 
frequency responses would match the “comb- 
like” frequency responses of the interleaved 
luminance and chrominance signals shown in 
Figures 8.4 and 8.15. 

Modern comb filters typically use two line 
delays for storing the last two lines of video 
information (there is a one-line delay in decod- 
ing using this method). Using more than two 
line delays usually results in excessive vertical 
filtering, reducing vertical resolution. 
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Two Line Delay Comb Filters 

The BBC has done research (Reference 4) on 
various PAL comb filtering implementations 
(Figures 9.39 through 9.42). Each was evalu- 
ated for artifacts and frequency response. The 
vertical frequency response for each comb fil- 
ter is shown in Figure 9.43. 

In the comb filter design of Figure 9.39, 
the chrominance phase is inverted over two 
lines of delay. A subtracter cancels most of the 
luminance, leaving double-amplitude, verti- 
cally filtered chrominance. A PAL modifier pro- 
vides a 90° phase shift and removal of the PAL 
switch inversion to phase align the chromi- 
nance with the one line-delayed composite 
video signal. Subtracting the chrominance 
from the composite signal leaves luminance. 
This design has the advantage of vertical align- 
ment of the chrominance and luminance. How- 
ever, there is a loss of vertical resolution and 
no suppression of Hanover bars. In addition, it 
is possible under some circumstances to gen- 
erate double-amplitude luminance due to the 
aliased luminance components produced by 
the PAL modifier. 

The comb filter design of Figure 9.40 is 
similar to the one in Figure 9.39. However, the 
chrominance after the PAL modifier and one 
line-delayed composite video signal are added 
to generate double-amplitude chrominance 
(since the subcarriers are in phase). Again, 
subtracting the chrominance from the compos- 
ite signal leaves luminance. In this design, 
luminance over-ranging is avoided since both 
the true and aliased luminance signals are 
halved. There is less loss of vertical resolution 
and Hanover bars are suppressed, at the 
expense of increased cross-color. 

The comb filter design in Figure 9.41 has 
the advantage of not using a PAL modifier. 
Since the chrominance phase is inverted over 
two lines of delay, adding them together can- 
cels most of the chrominance, leaving double- 


amplitude luminance. This is subtracted from 
the one line-delayed composite video signal to 
generate chrominance. Chrominance is then 
subtracted from the one line-delayed compos- 
ite video signal to generate luminance (this is 
to maintain vertical luminance resolution). UV 
crosstalk is present as a 12.5-Hz flicker on hori- 
zontal chrominance edges, due to the chromi- 
nance signals not cancelling in the adder since 
the line-to-line subcarrier phases are not 
aligned. Since there is no PAL modifier, there 
is no luminance aliasing or luminance over- 
ranging. 

The comb filter design in Figure 9.42 is a 
combination of Figures 9.39 and 9.41. The 
chrominance phase is inverted over two lines 
of delay. An adder cancels most of the chromi- 
nance, leaving double-amplitude luminance. 
This is subtracted from the one line-delayed 
composite video signal to generate chromi- 
nance signal (A). In a parallel path, a sub- 
tracter cancels most of the luminance, leaving 
double-amplitude, vertically filtered chromi- 
nance. A PAL modifier provides a 90° phase 
shift and removal of the PAL switch inversion 
to phase align to the (A) chrominance signal. 
These are added together, generating double- 
amplitude chrominance. Chrominance then is 
subtracted from the one line-delayed compos- 
ite signal to generate luminance. The chromi- 
nance and luminance vertical frequency 
responses are the average of those for Figures 
9.39 and 9.41. UV crosstalk is similar to that for 
Figure 9.41, but has half the amplitude. The 
luminance alias is also half that of Figure 9.39, 
and Hanover bars are suppressed. 

From these comb filter designs, the BBC 
has derived designs optimized for general 
viewing (Figure 9.44) and standards conver- 
sion (Figure 9.45). 

For PAL applications, the best luminance 
processing (Figure 9.41) was combined with 
the optimum chrominance processing (Figure 
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Figure 9.39. Two Line Roe PAL Y/C Separator. 
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Figure 9.40. Two Line -6 dB Roe PAL Y/C Separator. 
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Figure 9.42. Two Line Weston PAL Y/C Separator. 
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Figure 9.43. Vertical Frequency Characteristics of the Comb Filters in Figures 9.39 
Through 9.42. 


438 Chapter 9: NTSC/PAL Digital Encoding and Decoding 


9.40). The difference between the two designs 
is the chrominance recovery. For standards 
conversion (Figure 9.45), the chrominance sig- 
nal is just the full-bandwidth composite video 
signal. Standards conversion uses vertical 
interpolation which tends to reduce moving 
and high vertical frequency components, 
including cross-luminance and_ cross-color. 
Thus, vertical chrominance resolution after 
processing usually will be better than that 
obtained from the circuits for general viewing. 
The circuit for general viewing (Figure 9.44) 


recovers chrominance with a goal of reducing 
cross-effects, at the expense of chrominance 
vertical resolution. 

For NTSC applications, the design of comb 
filters is easier. There are no switched subcar- 
riers to worry about, and the chrominance 
phases are a 180° per line, rather than 270°. In 
addition, there is greater separation between 
the luminance and chrominance frequency 
bands than in PAL, simplifying the separation 
requirements. 
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Figure 9.44. Two Line Delay PAL Y/C Separator Optimized for General Viewing. 
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Figure 9.45. Two Line Delay PAL Y/C Separator Optimized for Standards Conversion and Video 


Processing. 
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In Figures 9.46 and 9.47, the adder gener- 
ates a double-amplitude composite video signal 
since the subcarriers are in phase. There is a 
180° subcarrier phase difference between the 
output of the adder and the one line-delayed 
composite video signal, so subtracting the two 
cancels most of the luminance, leaving double 
amplitude chrominance. 

The main disadvantage of the design in 
Figure 9.46 is the unsuppressed cross-lumi- 
nance on vertical color transitions. However, 
this is offset by the increased luminance reso- 
lution over simple lowpass filtering. The rea- 
sons for processing chrominance in Figure 
9.47 are the same as for PAL in Figure 9.45. 


Adaptive Comb Filtering 

Conventional comb filters still have problems 
with diagonal lines and vertical color changes 
since only vertically-aligned samples are used 
for processing. 

With diagonal lines, after standard comb 
filtering, the chrominance information also 
includes the difference between adjacent lumi- 
nance values, which may also be interpreted as 
chrominance information. This shows up as 
cross-color artifacts, such as a rainbow appear- 
ance along the edge of the line. 

Sharp vertical color transitions generate 
the “hanging dot” pattern commonly seen on 
the scan line between the two color changes. 
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Figure 9.46. Two Line Delay NTSC Y/C Separator for General Viewing. 
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Figure 9.47. Two Line Delay NTSC Y/C Separator for Standards Conversion and Video Processing. 
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After standard comb filtering, the luminance 
information contains the color subcarrier. The 
amplitude of the color subcarrier is deter- 
mined by the difference between the two col- 
ors. Thus, different colors modulate the 
luminance intensity differently, creating a 
“dot” pattern on the scan line between two col- 
ors. To eliminate these “hanging dots,” a 
chroma trap filter is sometimes used after the 
comb filter. 

The adaptive comb filter attempts to solve 
these problems by processing a 3 x 3, 5 x 5, or 
larger, block of samples. The values of the sam- 
ples are used to determine which Y/C separa- 
tion algorithm to use for the center sample. As 
many as 32, or more, algorithms may be avail- 
able. By looking for sharp vertical transitions 
of luminance, or sharp color subcarrier phase 
changes, the operation of the comb filter is 
changed to avoid generating artifacts. 

Due to the cost of integrated line stores, 
the consumer market commonly uses 3-line 
adaptive comb filtering, with the next level of 
improvement being 3D motion adaptive comb 
filtering. 


3D Comb Filtering 
This method (also called inter-field Y/C sepa- 
ration) uses composite video data from the cur- 
rent field and from two fields (NTSC) or four 
fields (PAL) earlier. Adding the two cancels the 
chrominance (since it is 180° out of phase), 
leaving luminance. Subtracting the two cancels 
the luminance, leaving chrominance. For PAL, 
an adequate design may be obtained by replac- 
ing the line delays in Figure 9.42 with frame 
delays. 

This technique provides nearly-perfect Y/ 
C separation for stationary pictures. However, 
if there is any change between fields, the 
resulting Y/C separation is erroneous. For this 


reason, inter-field Y/C separators usually are 
not used, unless as part of a 3D motion adap- 
tive comb filter. 


3D Motion Adaptive Comb Filter 

A typical implementation that uses 3D (inter- 
field) comb filtering for still areas, and 2D 
(intra-field) comb filtering for areas of the pic- 
ture that contain motion, is shown in Figure 
9.48. The motion detector generates a value 
(K) of 0-1, allowing the luminance and chromi- 
nance signals from the two comb filters to be 
proportionally mixed. Hard switching between 
algorithms is usually visible. 

Figure 9.49 illustrates a simple motion 
detector block diagram. The concept is to com- 
pare frame-to-frame changes in the low fre- 
quency luminance signal. Its performance 
determines, to a large degree, the quality of 
the image. The motion signal (K) is usually 
rectified, smoothed by averaging horizontally 
and vertically over a few samples, multiplied 
by a gain factor, and clipped before being used. 
The only error the motion detector should 
make is to use the 2D comb filter on stationary 
areas of the image. 


Alpha Channel Support 


By incorporating an additional ADC within the 
NTSC/PAL decoder, an analog alpha signal 
(also called a key) may be digitized, and pipe- 
lined with the video data to maintain synchro- 
nization. This allows the designer to change 
decoders (which may have different pipeline 
delays) to fit specific applications without wor- 
rying about the alpha channel pipeline delay. 
Alpha is usually linear, with an analog range of 
0-100 IRE. There is no blanking pedestal or 
sync information present. 
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Figure 9.48. 3D Motion Adaptive Y/C Separator. 
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Figure 9.49. Simple Motion Detector Block Diagram for NTSC. 
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Decoder Video Parameters 


Many industry-standard video parameters 
have been defined to specify the relative qual- 
ity of NISC/PAL decoders. To measure these 
parameters, the output of the NTSC/PAL 
decoder (while decoding various video test sig- 
nals such as those described in Chapter 8) is 
monitored using video test equipment. Along 
with a description of several of these parame- 
ters, typical AC parameter values for both con- 
sumer and studio-quality decoders are shown 
in Table 9.11. 

Several AC parameters, such as short-time 
waveform distortion, group delay, and K fac- 
tors, are dependent on the quality of the analog 
video filters and are not discussed here. In 
addition to the AC parameters discussed in 
this section, there are several others that 
should be included in a decoder specification, 
such as burst capture and lock frequency 
range, and the bandwidths of the decoded YIQ 
or YUV video signals. 

There are also several DC parameters that 
should be specified, as shown in Table 9.12. 
Although genlock capabilities are not usually 
specified, except for “clock jitter,” we have 
attempted to generate a list of genlock parame- 
ters, shown in Table 9.13. 


Differential Phase 

Differential phase distortion, commonly 
referred to as differential phase, specifies how 
much the chrominance phase is affected by 
the luminance level—in other words, how 
much hue shift occurs when the luminance 
level changes. Both positive and negative 
phase errors may be present, so differential 
phase is expressed as a peak-to-peak measure- 
ment, expressed in degrees of subcarrier 
phase. 


This parameter is measured using a test 
signal of uniform-phase and amplitude chromi- 
nance superimposed on different luminance 
levels, such as the modulated ramp test signal, 
or the modulated five-step portion of the com- 
posite test signal. The differential phase 
parameter for a studio-quality decoder may 
approach 1° or less. 


Differential Gain 
Differential gain distortion, commonly 
referred to as differential gain, specifies how 
much the chrominance gain is affected by the 
luminance level—in other words, how much 
color saturation shift occurs when the lumi- 
nance level changes. Both attenuation and 
amplification may occur, so differential gain is 
expressed as the largest amplitude change 
between any two levels, expressed as a per- 
centage of the largest chrominance amplitude. 
This parameter is measured using a test 
signal of uniform phase and amplitude chromi- 
nance superimposed on different luminance 
levels, such as the modulated ramp test signal, 
or the modulated five-step portion of the com- 
posite test signal. The differential gain parame- 
ter for a studio-quality decoder may approach 
1% or less. 


Luminance Nonlinearity 
Luminance nonlinearity, also referred to as dif- 
ferential luminance and luminance nonlinear 
distortion, specifies how much the luminance 
gain is affected by the luminance level. In 
other words, there is a nonlinear relationship 
between the decoded luminance level and the 
ideal luminance level. 

Using an unmodulated five-step or ten-step 
staircase test signal, or the modulated five-step 
portion of the composite test signal, the differ- 
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Parameter pe ser Cael Units 
differential phase 4 <1 degrees 
differential gain 4 <1 % 
luminance nonlinearity 2 <1 % 
hue accuracy 3 <1 degrees 
color saturation accuracy 3 <1 % 
SNR (per EIA/TIA RS-250-C) 48 > 60 dB 
chrominance-to-luminance crosstalk <—40 <-50 dB 
luminance-to-chrominance crosstalk <—40 <-50 dB 
AH tilt <1 <1 % 

V tilt <1 <1 % 
Y/C sampling skew <5 <2 ns 
demodulation quadrature 90 +2 90 +0.5 degrees 


Table 9.11. Typical AC Video Parameters for NTSC and PAL Decoders. 


Parameter ee (B, rae HD) Units 
sync input amplitude 40 +20 43 +22 IRE 
burst input amplitude 40 +20 42.86 +22 IRE 
video input amplitude (1v nominal) 0.5 to 2.0 0.5 to 2.0 volts 


Table 9.12. Typical DC Video Parameters for NTSC and PAL Decoders. 
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Parameter Min Max Units 
sync locking time! 2 fields 
sync recovery time? 2 fields 
short-term sync lock range® +100 ns 
long-term sync lock range* +5 us 
number of consecutive missing horizontal 
sync pulses before any correction 5 sync pulses 
vertical correlation® +5 ns 
short-term subcarrier locking range® +200 Hz 
long-term subcarrier locking range” +500 Hz 
subcarrier locking time® 10 lines 
subcarrier accuracy +2 degrees 


Notes: 

1. Time from start of genlock process to vertical correlation specification is achieved. 

2. Time from loss of genlock to vertical correlation specification is achieved. 

3. Range over which vertical correlation specification is maintained. Short-term range 
assumes line time changes by amount indicated slowly between two consecutive lines. 

4. Range over which vertical correlation specification is maintained. Long-term range 
assumes line time changes by amount indicated slowly over one field. 

5. Indicates vertical sample accuracy. For a genlock system that uses a VCO or VCXO, this 
specification is the same as sample clock jitter. 

6. Range over which subcarrier locking time and accuracy specifications are maintained. 
Short-term time assumes subcarrier frequency changes by amount indicated slowly over 2 
frames. 

7. Range over which subcarrier locking time and accuracy specifications are maintained. 
Long-term time assumes subcarrier frequency changes by amount indicated slowly over 
24 hours. 

8. After instantaneous 180° phase shift of subcarrier, time to lock to within +2°. Subcarrier 
frequency is nominal +500 Hz. 


Table 9.13. Typical Genlock Parameters for NTSC and PAL Decoders. Parameters 
assume a video signal with > 30 dB SNR and over the range of DC parameters in 
Table 9.12. 
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ence between the largest and smallest steps, 
expressed as a percentage of the largest step, 
is used to specify the luminance nonlinearity. 
Although this parameter is included within the 
differential gain and phase parameters, it is tra- 
ditionally specified independently. 


Chrominance Nonlinear Phase Distortion 
Chrominance nonlinear phase distortion speci- 
fies how much the chrominance phase (hue) is 
affected by the chrominance amplitude (satu- 
ration)—in other words, how much hue shift 
occurs when the saturation changes. 

Using a modulated pedestal test signal, or 
the modulated pedestal portion of the combi- 
nation test signal, the decoder output for each 
chrominance packet is measured. The differ- 
ence between the largest and the smallest hue 
measurements is the peak-to-peak value. This 
parameter is usually not specified indepen- 
dently, but is included within the differential 
gain and phase parameters. 


Chrominance Nonlinear Gain Distortion 
Chrominance nonlinear gain distortion speci- 
fies how much the chrominance gain is 
affected by the chrominance amplitude (satu- 
ration). In other words, there is a nonlinear 
relationship between the decoded chromi- 
nance amplitude levels and the ideal chromi- 
nance amplitude levels—this is usually seen as 
an attenuation of highly saturated chromi- 
nance signals. 

Using a modulated pedestal test signal, or 
the modulated pedestal portion of the combi- 
nation test signal, the decoder is adjusted so 
that the middle chrominance packet (40 IRE) 
is decoded properly. The largest difference 
between the measured and nominal values of 
the amplitudes of the other two decoded 
chrominance packets specifies the chromi- 
nance nonlinear gain distortion, expressed in 
IRE or as a percentage of the nominal ampli- 


tude of the worst-case packet. This parameter 
is usually not specified independently, but is 
included within the differential gain and phase 
parameters. 


Chrominance-to-Luminance 
Intermodulation 

Chrominance-to-luminance intermodulation, 
commonly referred to as cross-modulation, 
specifies how much the luminance level is 
affected by the chrominance. This may be the 
result of clipping highly saturated chromi- 
nance levels or quadrature distortion and may 
show up as irregular brightness variations due 
to changes in color saturation. 

Using a modulated pedestal test signal, or 
the modulated pedestal portion of the combi- 
nation test signal, the largest difference 
between the decoded 50 IRE luminance level 
and the decoded luminance levels specifies the 
chrominance-to-luminance intermodulation, 
expressed in IRE or as a percentage. This 
parameter is usually not specified indepen- 
dently, but is included within the differential 
gain and phase parameters. 


Hue Accuracy 

Hue accuracy specifies how closely the 
decoded hue is to the ideal hue value. Both 
positive and negative phase errors may be 
present, so hue accuracy is the difference 
between the worst-case positive and worst-case 
negative measurements from nominal, 
expressed in degrees of subcarrier phase. This 
parameter is measured using EIA or EBU 75% 
color bars as a test signal. 


Color Saturation Accuracy 

Color saturation accuracy specifies how 
closely the decoded saturation is to the ideal 
saturation value, using EIA or EBU 75% color 
bars as a test signal. Both gain and attenuation 
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may be present, so color saturation accuracy is 
the difference between the worst-case gain and 
worst-case attenuation measurements from 
nominal, expressed as a percentage of nomi- 
nal. 


H Tilt 

H tilt, also known as line tilt and line time dis- 
tortion, causes a tilt in line-rate signals, pre- 
dominantly white bars. This type of distortion 
causes variations in brightness between the 
left and right edges of an image. For a digital 
decoder, H tilt is primarily an artifact of the 
analog input filters and the transmission 
medium. H tilt is measured using a line bar 
(such as the one in the NTC-7 NTSC compos- 
ite test signal) and measuring the peak-to-peak 
deviation of the tilt Gn IRE or percentage of 
white bar amplitude), ignoring the first and last 
microsecond of the white bar. 


V Tilt 

V tilt, also known as field tilt and field time dis- 
tortion, causes a tilt in field-rate signals, pre- 
dominantly white bars. This type of distortion 
causes variations in brightness between the 
top and bottom edges of an image. For a digital 
decoder, V tilt is primarily an artifact of the 
analog input filters and the transmission 
medium. V tilt is measured using an 181s, 100- 
IRE white bar in the center of 130 lines in the 
center of the field or using a field square wave. 
The peak-to-peak deviation of the tilt is mea- 
sured (in IRE or percentage of white bar ampli- 
tude), ignoring the first and last three lines. 
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H.261 and H.263 


There are several standards for video confer- 
encing, as shown in Table 10.1. Figures 10.1 
through 10.3 illustrate the block diagrams of 
several common video conferencing systems. 


H.261 


ITU-T H.261 was the first video compression 
and decompression standard developed for 
video conferencing. Originally designed for bit 
rates of p x 64 kbps, where p is in the range 1- 
30, H.261 is now the minimum requirement of 
all video conferencing standards, as shown in 
Table 10.1. 

A typical H.261 encoder block diagram is 
shown in Figure 10.4. The video encoder pro- 
vides a self-contained digital video bitstream 
which is multiplexed with other signals, such 
as control and audio. The video decoder per- 
forms the reverse process. 

H.261 video data uses the 4:2:0 YCbCr for- 
mat shown in Figure 3.7, with the primary 
specifications listed in Table 10.2. The maxi- 
mum picture rate may be restricted by having 
0, 1, 2, or 3 non-transmitted pictures between 
transmitted ones. 
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Two picture (or frame) types are sup- 
ported: 


Intra or I Frame: A frame having no reference 
frame for prediction. 


Inter or P Frame: A frame based on a previous 
frame. 


Coding Algorithm 


As shown in Figure 10.4, the basic functions 
are prediction, block transformation, and quan- 
tization. 

The prediction error (inter mode) or the 
input picture (intra mode) is subdivided into 8 
sample x 8 line blocks that are segmented as 
transmitted or non-transmitted. Four lumi- 
nance blocks and the two spatially correspond- 
ing color difference blocks are combined to 
form a 16 sample x 16 line macroblock as 
shown in Figure 10.5. 

The criteria for choice of mode and trans- 
mitting a block are not recommended and may 
be varied dynamically as part of the coding 
strategy. Transmitted blocks are transformed 
and the resulting coefficients quantized and 
variable-length coded. 
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H.310 H.320 H.324 H.322 H.323 H.324 | H.324/C 
Non-guaranteed 
Guaranteed : 
Broadband | Narrowband | 5 oadband | Bandwidth | Bandwidth 
Switched Packet PSTN . 
network ISDN Digital ISDN Packet Switched + POTS Mobile 
ATM LAN 8 ATMLAN | Switched COS 7 
ISDN Nebmocks Networks 
(Ethernet) 
eee MPEG 2 H.261 H.261 H.261 H.261 H.261 H.261 
H.261 1.263 1.263 1.263 H.263 1.263 1.263 
G.711 
Se ae G.711 G71 G71 G.722 
wiidiG. GARE nao. G.722 G72 G.722 G.723 G.723 G.723 
pied G.728 G.728 G.728 G.728 
G.729 
ae H.222.0 
multiplexing ane H.221 H.221 H.221 H.225.0 1.223 H.223A 
H.230 H.230 
control H.245 oe H.242 Hoe H.245 H.245 H.245 
niulapoint H.231 H.231 H.231 H.323 
data 1120 7120 7120 1.120 7120 T120 7120 
ae AAL 1363 AAL L363 1.400 
Se EeeeOns || AIM TSC 1.400 AJM 1.361 and TCP/IP ce a 
PHY 1.432 PHY 1400 | TCP/IP 


Table 10.1. Video Conferencing Family of Standards. 


450 


Chapter 10: H.261 and H.263 


H.320 


VIDEO CODEC 
(H.261, H.263) 


AUDIO CODEC 
(G.7XX) 


DATA INTERFACE 
(1.120) 


CONTROL 
(H.242, H.230) 


NETWORK 
INTERFACE 


(1.400 SERIES) 


Figure 10.1. Typical H.320 System. 


H.323 


VIDEO CODEC 
(H.261, H.263) 


AUDIO CODEC 
(G.7XX) 


DATA PROTOCOLS 
(7.120, LAPM, ETC.) 


CONTROL 


LAPM XID 
(H.245) PROCEDURES 


NETWORK 
INTERFACE 


TCP/IP 


Figure 10.2. Typical H.323 System. 
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H.324 


VIDEO CODEC 
(H.261, H.263) 


AUDIO CODEC 
(G.723) 


DATA PROTOCOLS 
(1.120, LAPM, ETC.) 


CONTROL 
(H.245) 


LAPM XID 
PROCEDURES 


MODEM 
Be (V.34 /V.8) 
MODEM 
CONTROL 
(V.25TER) 


Figure 10.3. Typical H.324 System. 


Prediction 

The prediction is inter-picture and may include 
motion compensation and a spatial filter. The 
coding mode using prediction is called inter; 
the coding mode using no prediction is called 
intra. 


Motion Compensation 

Motion compensation is optional in the 
encoder. The decoder must support the accep- 
tance of one motion vector per macroblock. 
Motion vectors are restricted—all samples ref- 
erenced by them must be within the coded pic- 
ture area. 

The horizontal and vertical components of 
motion vectors have integer values not exceed- 
ing +15. The motion vector is used for all four 
Y blocks in the macroblock. The motion vector 
for both the Cb and Cr blocks is derived by 
halving the values of the macroblock vector. 


A positive value of the horizontal or verti- 
cal component of the motion vector indicates 
that the prediction is formed from samples in 
the previous picture that are spatially to the 
right or below the samples being predicted. 


Loop Filter 

The prediction process may use a 2D spatial fil- 
ter that operates on samples within a predicted 
8 x 8 block. 

The filter is separated into horizontal and 
vertical functions. Both are non-recursive with 
coefficients of 0.25, 0.5, 0.25 except at block 
edges where one of the taps fall outside the 
block. In such cases, the filter coefficients are 
changed to 0, 1, 0. 

The filter is switched on or off for all six 
blocks in a macroblock according to the mac- 
roblock type. 
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INTRA / INTER 
FLAG 
CODING CONTROL 
YCBCR | SC TRANSMIT 
VIDEO IN FLAG 
QUANTIZER 
INDICATION 
~ 
switch --=| pcT --=) quantizeR [|-—g—_> QUANTIZING 
+ INDEX 
’ D) Z 
INVERSE 
QUANTIZER 
IDCT 
Ll 
re) See | 
SWITCH =) 
a 
taae PICTURE MEMORY 
— scuteg 4 [7 s(WITH MOTION COMPENSATED [-=——" 
VARIABLE DELAY MOTION 
VECTOR 
4 
_ LOOP FILTER 
ON / OFF 
Figure 10.4. Typical H.261 Encoder. 
Parameters CIF QCIF 
active resolution (Y) 352 x 288 176 x 144 
frame refresh rate 29.97 Hz 
YCbCr sampling structure 4:2:0 


form of YCbCr coding 


Uniformly quantized PCM, 8 bits per sample. 


Table 10.2. H.261 YCbCr Parameters. 
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DCT, IDCT 
Transmitted blocks are first processed by an 8 
x 8 DCT (discrete cosine transform). The out- 
put from the IDCT (inverse DCT) ranges from 
—256 to +255 after clipping, represented using 9 
bits. 

The procedures for computing the trans- 
forms are not defined, but the inverse trans- 
form must meet the specified error tolerance. 


Quantization 

Within a macroblock, the same quantizer is 
used for all coefficients, except the one for 
intra DC. The intra DC coefficient is usually 
linearly quantized with a step size of 8 and no 
dead zone. The other coefficients use one of 31 
possible linear quantizers, but with a central 
dead zone about zero and a step size of an even 
value in the range of 2-62. 


352 SAMPLES 


Clipping of Reconstructed Picture 
Clipping functions are used to prevent quanti- 
zation distortion of transform coefficient 
amplitudes, possibly causing arithmetic over- 
flows in the encoder and decoder loops. The 
clipping function is applied to the recon- 
structed picture, formed by summing the pre- 
diction and the prediction error. Clippers force 
sample values less than 0 to be 0 and values 
greater than 255 to be 255. 


Coding Control 

Although not included as part of H.261, several 
parameters may be varied to control the rate of 
coded video data. These include processing 
prior to coding, the quantizer, block signifi- 
cance criterion, and temporal subsampling. 
Temporal subsampling is performed by dis- 
carding complete pictures. 


GROUP OF BLOCKS 


BLOCK ARRANGEMENT 
WITHIN A MACROBLOCK 


CB DCT 5 
CR DCT 4 
DCT O DCT 1 
Y 
DCT 2 DCT 3 


MACROBLOCKS 


Figure 10.5. H.261 Arrangement of Group of Blocks, Macroblocks, and Blocks. 
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Forced Updating 

This is achieved by forcing the use of the intra 
mode of the coding algorithm. To control the 
accumulation of inverse transform mismatch 
errors, a macroblock should be _ forcibly 
updated at least once every 132 times it is 
transmitted. 


Video Bitstream 


Unless specified otherwise, the most signifi- 
cant bits are transmitted first. This is bit 1 and 
is the leftmost bit in the code tables. Unless 
specified otherwise, all unused or spare bits 
are set to “1.” 


The video bitstream is a hierarchical struc- 
ture with four layers. From top to bottom the 
layers are: 


Picture 

Group of Blocks (GOB) 
Macroblock (MB) 
Block 


Picture Layer 
Data for each picture consists of a picture 
header followed by data for group of blocks 
(GOBs). The structure is shown in Figure 
10.6. Picture headers for dropped pictures are 
not transmitted. 


GOB | GOB GOB 


GEI GSPARE GEI MB MB te MB 


PICTURE LAYER PSC TR PTYPE PEI PSPARE PEI 
GOB LAYER GBSC GN GQUANT 
MACROBLOCK LAYER MBA MTYPE MQUANT MVD CBP 
/ 
BLOCK LAYER TCOEFF 


BLOCK 1 BLOCK 6 


EOB 


Figure 10.6. H.261 Video Bitstream Layer Structures. 
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Picture Start Code (PSC) 
PSC is a 20-bit word with a value of 0000 0000 
0000 0001 0000. 


Temporal Reference (TR) 

TR is a 5-bit binary number representing 32 
possible values. It is generated by increment- 
ing the value in the previous picture header by 
one plus the number of non-transmitted pic- 
tures (at 29.97 Hz). The arithmetic is per- 
formed with only the five LSBs. 


Type Information (PTYPE) 
Six bits of information about the picture are: 


Bit1 Split screen indicator 
“Q” = off, <?? = on 

Bit2 Document camera indicator 
“Q” = off, a) ke = on 

Bit3 Freeze picture release 
“Q” = off, ? = on 

Bit4 Source format 
“Q” = QCIF oi” = CIF 

Bit5 Optional still image mode 
“Q” 2 on, oii = off 

Bit6 Spare 


Extra Insertion Information (PEI) 
PEI is a bit which when set to “1” indicates the 
presence of the following optional data field. 


Spare Information (PSPARE) 

If PEI is set to “1,” then these 9 bits follow con- 
sisting of 8 bits of data (PSPARE) and another 
PEI bit to indicate if a further 9 bits follow, and 
so on. 


Group of Blocks (GOB) Layer 

Each picture is divided into groups of blocks 
(GOB). A GOB comprises one-twelfth of the 
CIF picture area or one-third of the QCIF pic- 
ture area (see Figure 10.5). A GOB relates to 
176 samples x 48 lines of Y and the corre- 
sponding 88 x 24 array of Cb and Cr data. 

Data for each GOB consists of a GOB 
header followed by macroblock data, as shown 
in Figure 10.6. Each GOB header is transmit- 
ted once between picture start codes in the 
CIF or QCIF sequence numbered in Figure 
10.5, even if no macroblock data is present in 
that GOB. 


Group of Blocks Start Code (GBSC) 
GBSC is a 16-bit word with a value of 0000 0000 
0000 0001. 


Group Number (GN) 

GN is a 4-bit binary value indicating the posi- 
tion of the group of blocks. The bits are the 
binary representation of the number in Figure 
10.5. Numbers 13, 14, and 15 are reserved for 
future use. 


Quantizer Information (GQUANT) 

GQUANT is a 5-bit binary value that indicates 
the quantizer used for the group of blocks until 
overridden by any subsequent MQUANT. Val- 
ues of 1-31 are allowed. 


Extra Insertion Information (GEI) 

GEI is a bit which, when set to “1,” indicates 
the presence of the following optional data 
field. 


Spare Information (GSPARE) 

If GEI is set to “1,” then these 9 bits follow con- 
sisting of 8 bits of data (GSPARE) and then 
another GEI bit to indicate if a further 9 bits 
follow, and so on. 
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Macroblock (MB) Layer 
Each GOB is divided into 33 macroblocks as 
shown in Figure 10.5. A macroblock relates to 
16 samples x 16 lines of Y and the correspond- 
ing 8 x 8 array of Cb and Cr data. 

Data for a macroblock consists of a macro- 
block header followed by data for blocks (see 
Figure 10.6). 


Macroblock Address (MBA) 

MBA is a variable-length codeword indicating 
the position of a macroblock within a group of 
blocks. The transmission order is shown in 
Figure 10.5. For the first macroblock in a GOB, 
MBA is the absolute address in Figure 10.5. 
For subsequent macroblocks, MBA is the dif- 
ference between the absolute addresses of the 
macroblock and the last transmitted macro- 
block. The code table for MBA is given in 
Table 10.3. 

A codeword is available for bit stuffing 
immediately after a GOB header or a coded 
macroblock (called MBA stuffing). This code- 
word is discarded by decoders. 

The codeword for the start code is also 
shown in Table 10.3. MBA is always included 
in transmitted macroblocks. Macroblocks are 
not transmitted when they contain no informa- 
tion for that part of the picture. 


Type Information (MTYPE) 

MTYPE is a variable-length codeword contain- 
ing information about the macroblock and data 
elements that are present. Macroblock types, 
included elements, and variable-length code- 
words are listed in Table 10.4. MTYPE is 
always included in transmitted macroblocks. 


Quantizer (MQUANT) 

MQUANT is present only if indicated by 
MTYPE. It is a 5-bit codeword indicating the 
quantizer to use for this and any following 


blocks in the group of blocks, until overridden 
by any subsequent MQUANT. Codewords for 
MQUANT are the same as for GQUANT. 


Motion Vector Data (MVD) 

Motion vector data is included for all motion- 
compensated (MC) macroblocks, as indicated 
by MTYPE. MVD is obtained from the macro- 
block vector by subtracting the vector of the 
preceding macroblock. The vector of the previ- 
ous macroblock is regarded as zero for the fol- 
lowing situations: 


(a) Evaluating MVD for macroblocks 1, 
12, and 23. 


(b) Evaluating MVD for macroblocks 
where MBA does not represent a 
difference of 1. 


(c) MTYPE of the previous macroblock 
was not motion-compensated. 


Motion vector data consists of a variable- 
length codeword for the horizontal component, 
followed by a variable-length codeword for the 
vertical component. The variable-length codes 
are listed in Table 10.5. 


Coded Block Pattern (CBP) 

The variable-length CBP is present if indicated 
by MTYPE. It indicates which blocks in the 
macroblock have at least one transform coeffi- 
cient transmitted. The pattern number is rep- 
resented as: 


PoP PoP3P4P5 


where P,, = “1” for any coefficient present for 
block [n], else P, = “0.” Block numbering is 
given in Figure 10.5. 

The code words for the CBP number are 
given in Table 10.6. 
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MBA Code MBA Code 
1 1 17 0000 0101 10 
2 011 18 0000 0101 01 
3 010 19 0000 0101 00 
4 0011 20 0000 0100 11 
5 0010 21 0000 0100 10 
6 0001 1 22 0000 0100 011 
7 0001 0 23 0000 0100 010 
8 0000 111 24 0000 0100 001 
9 0000 110 25 0000 0100 000 
10 0000 1011 26 0000 0011 111 
11 0000 1010 27 0000 0011 110 
12 0000 1001 28 0000 0011 101 
13 0000 1000 29 0000 0011 100 
14 0000 0111 30 0000 0011 011 
15 0000 0110 31 0000 0011 010 
16 0000 0101 11 32 0000 0011 001 
33 0000 0011 000 
MBA stuffing 0000 0001 111 
start code 0000 0000 0000 0001 
Table 10.3. H.261 Variable-Length Code Table for MBA. 
Prediction MQUANT MVD CBP TCOEFF Code 
intra x 0001 
intra x x 0000 001 
inter x 1 
inter x x 0000 1 
inter + MC x 0000 0000 1 
inter + MC x x 0000 0001 
inter + MC x x 0000 0000 01 
inter + MC + FIL x 001 
inter + MC + FIL x 01 
inter + MC + FIL x x 0000 01 


Table 10.4. H.261 Variable-Length Code Table for MTYPE. 
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Block Layer 
A macroblock is made up of four Y blocks, a 
Cb block, and a Cr block (see Figure 10.5). 
Data for an 8 sample x 8 line block consists 
of codewords for the transform coefficients fol- 
lowed by an end of block (EOB) marker as 
shown in Figure 10.6. The order of block trans- 
mission is shown in Figure 10.5. 


Transform Coefficients (TCOEFF) 

When MTYPE indicates intra, transform coef- 
ficient data is present for all six blocks in a 
macroblock. Otherwise, MTYPE and CBP sig- 
nal which blocks have coefficient data trans- 
mitted for them. The quantized DCT 
coefficients are transmitted in the order shown 
in Figure 7.50. 


Vector Difference Code Vector Difference Code 
-16& 16 0000 0011 001 1 010 
-15&17 0000 0011 011 2 & -30 0010 
-14&18 0000 0011 101 3 & -29 0001 0 
-13 &19 0000 0011 111 4 & -28 0000 110 
-12 & 20 0000 0100 001 5 & -27 0000 1010 
-11&21 0000 0100 011 6 & -26 0000 1000 
-10 & 22 0000 0100 11 7&-25 0000 0110 
-9 & 23 0000 0101 01 8 & -24 0000 0101 10 
-8 & 24 0000 0101 11 9 & -23 0000 0101 00 
-7&25 0000 0111 10 & -22 0000 0100 10 
-6 & 26 0000 1001 11 &-21 0000 0100 010 
-5 & 27 0000 1011 12 & -20 0000 0100 000 
-4 & 28 0000 111 13 &-19 0000 0011 110 
-3 & 29 0001 1 14 &-18 0000 0011 100 
-2 & 30 0011 15 &-17 0000 0011 010 
-1 011 
0 1 


Table 10.5. H.261 Variable-Length Code Table for MVD. 
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CBP Code CBP Code 
60 111 62 0100 0 
4 1101 24 0011 11 
8 1100 36 0011 10 
16 1011 3 0011 01 
32 1010 63 0011 00 
12 1001 1 5 0010 111 
48 1001 0 9 0010 110 
20 1000 1 17 0010 101 
40 1000 0 33 0010 100 
28 0111 1 6 0010 011 
44 0111 0 10 0010 010 
52 0110 1 18 0010 001 
56 0110 0 34 0010 000 
1 0101 1 7 0001 1111 
61 0101 0 11 0001 1110 
2 0100 1 19 0001 1101 


Table 10.6a. H.261 Variable-Length Code Table for CBP. 


CBP Code CBP Code 

35 0001 1100 38 0000 1100 

13 0001 1011 29 0000 1011 

49 0001 1010 45 0000 1010 

21 0001 1001 53 0000 1001 

41 0001 1000 57 0000 1000 

14 0001 0111 30 0000 0111 

50 0001 0110 46 0000 0110 

22 0001 0101 54 0000 0101 

42 0001 0100 58 0000 0100 

15 0001 0011 31 0000 0011 1 
51 0001 0010 47 0000 0011 0 
23 0001 0001 55 0000 0010 1 
43 0001 0000 59 0000 0010 0 
25 0000 1111 27 0000 0001 1 
37 0000 1110 39 0000 0001 0 
26 0000 1101 


Table 10.6b. H.261 Variable-Length Code Table for CBP. 
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INTRA (|) FRAME 


[] PREDICTED (P) FRAME 


FH 


ql 


Figure 10.7. Typical H.261 Decoded Sequence. 


The most common combinations of suc- 
cessive zeros (RUN) and the following value 
(LEVEL) are encoded using variable-length 
codes, listed in Table 10.7. Since CBP indicates 
blocks with no coefficient data, EOB cannot 
occur as the first coefficient. The last bit “s” 
denotes the sign of the level: “0” = positive, “1” 
= negative. 

Other combinations of (RUN, LEVEL) are 
encoded using a 20-bit word: 6 bits of escape 
(ESC), 6 bits of RUN, and 8 bits of LEVEL, as 
shown in Table 10.8. 

Two code tables are used for the variable- 
length coding: one is used for the first trans- 
mitted LEVEL in inter, inter + MC, and inter + 
MC + FIL blocks; another is used for all other 
LEVELs, except for the first one in intra 
blocks, which is fixed-length coded with 8 bits. 

All coefficients, except for intra DC, have 
reconstruction levels (REC) in the range —2048 
to 2047. Reconstruction levels are recovered 
by the following equations, and the results are 
clipped. QUANT ranges from 1 to 31 and is 
transmitted by either GQUANT or MQUANT. 


QUANT = odd: 
for LEVEL > 0 
REC = QUANT x (2 x LEVEL + 1) 
for LEVEL < 0 
REC = QUANT x (2 x LEVEL - 1) 


QUANT = even: 
for LEVEL > 0 
REC = (QUANT x (2 x LEVEL + 1)) - 1 
for LEVEL < 0 
REC = (QUANT x (2 x LEVEL - 1)) +1 
for LEVEL = 0 
REC = 0 


For intra DC blocks, the first coefficient is 
typically the transform value quantized with a 
step size of 8 and no dead zone, resulting in an 
8-bit coded value, m. Black has a coded value of 
0001 0000 (16), and white has a coded value of 
1110 1011 (235). A transform value of 1024 is 
coded as 1111 1111. Coded values of 0000 0000 
and 1000 0000 are not used. The decoded value 
is 8n, except an m value of 255 results in a 
reconstructed transform value of 1024. 


H.261 
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Run Level Code 

EOB 10 
0 1 Is if first coefficient in block* 
0 1 lls if not first coefficient in block 
(0) 2 0100 s 
0 3 0010 Is 
0 4 0000 110s 
(0) 5 0010 0110 s 
(0) 6 0010 0001 s 
(0) 7 0000 0010 10s 
0 8 0000 0001 1101 s 
0) 9 0000 0001 1000 s 
0) 10 0000 0001 0011 s 
0) 11 0000 0001 0000 s 
(0) 12 0000 0000 1101 0s 
0 13 0000 0000 1100 Is 
(0) 14 0000 0000 1100 0s 
0 15 0000 0000 1011 Is 
1 1 Olls 
1 2 0001 10s 
1 3 0010 0101 s 
1 4 0000 0011 00s 
1 5 0000 0001 1011 s 
1 6 0000 0000 1011 0s 
1 7 0000 0000 1010 Is 
2 1 0101 s 
2 2 0000 100s 
2 3 0000 0010 11s 
2 4 0000 0001 0100 s 
2 5 0000 0000 1010 0s 
3 1 0011 Is 
3 2 0010 0100 s 
3 3 0000 0001 1100 s 
3 4 0000 0000 1001 Is 


Table 10.7a. H.261 Variable-Length Code Table for TCOEFF. 


*Never used in intra macroblocks. 
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Run Level Code 
4 1 0011 Os 
4 2 0000 0011 lls 
4 3 0000 0001 0010 s 
5 1 0001 lls 
5 2 0000 0010 Ols 
5 3 0000 0000 1001 Os 
6 1 0001 Ols 
6 2 0000 0001 1110 s 
7 1 0001 00s 
7 2 0000 0001 0101 s 
8 1 0000 111s 
8 2 0000 0001 0001 s 
9 1 0000 101s 
9 2 0000 0000 1000 Is 
10 1 0010 0111 s 
10 2 0000 0000 1000 Os 
11 1 0010 0011 s 
12 1 0010 0010 s 
13 1 0010 0000 s 
14 1 0000 0011 10s 
15 1 0000 0011 Ol1s 
16 1 0000 0010 00s 
17 1 0000 0001 1111 s 
18 1 0000 0001 1010 Ss 
19 1 0000 0001 1001 s 
20 1 0000 0001 0111 s 
21 1 0000 0001 0110 s 
22 1 0000 0000 1111 Is 
23 1 0000 0000 1111 Os 
24 1 0000 0000 1110 Is 
25 1 0000 0000 1110 Os 
26 1 0000 0000 1101 Is 
ESC 0000 01 


Table 10.7b. H.261 Variable-Length Code Table for TCOEFF. 
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Run Code 
0 0000 00 
1 0000 01 

63 1111 11 


Level Code 
-128 forbidden 
-127 1000 0001 
-2 1111 1110 
-l 1111 1111 
0) forbidden 
1 0000 0001 
2 0000 0010 
127 0111 1111 


Table 10.8. H.261 Run, Level Codes. 


Still Image Transmission 


H.261 allows the transmission of a still image 
of four times the resolution of the currently 
selected video format. If the video format is 
QCIF, a still image of CIF resolution may be 
transmitted; if the video format is CIF a still 
image of 704 x 576 resolution may be transmit- 
ted. 


H.263 


ITU-T H.263 improves on H.261 by providing 
improved video quality at lower bit rates. 

The video encoder provides a self-con- 
tained digital bitstream which is combined 
with other signals (such as H.223). The video 
decoder performs the reverse process. The 
primary specifications of H.263 regarding 
YCbCr video data are listed in Table 10.9. It is 
also possible to negotiate a custom picture 


size. The 4:2:0 YCbCr sampling is shown in 
Figure 3.7. 

With H.263 version 2 (formally known as 
H.263+), seven frame (or picture) types are 
now supported, with the first two being manda- 
tory (baseline H.263): 


Intra or I Frame: A frame having no reference 
frame for prediction. 


Inter or P Frame: A frame based on a previous 
frame. 


PB Frame and Improved PB Frame: A frame rep- 
resenting two frames and based on a previous 
frame. 


B Frame: A frame based two reference frames, 
one previous and one after wards. 


EI Frame: A frame having a temporally simulta- 
neous frame which has either the same or 
smaller frame size. 


EP Frame: A frame having a two reference 
frames, one previous and one simultaneous. 
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Coding Algorithm 


A typical encoder block diagram is shown in 
Figure 10.8. The basic functions are prediction, 
block transformation, and quantization. 

The prediction error or the input picture 
are subdivided into 8 x 8 blocks which are seg- 
mented as transmitted or non-transmitted. 
Four luminance blocks and the two spatially 
corresponding color difference blocks are 
combined to form a macroblock as shown in 
Figure 10.9. 

The criteria for choice of mode and trans- 
mitting a block are not recommended and may 
be varied dynamically as part of the coding 
strategy. Transmitted blocks are transformed 
and the resulting coefficients are quantized 
and variable-length coded. 


Prediction 
The prediction is interpicture and may include 
motion compensation. The coding mode using 
prediction is called inter; the coding mode 
using no prediction is called intra. 

Intra coding is signalled at the picture level 
(I frame for intra or P frame for inter) or at the 
macroblock level in P frames. In the optional 
PB frame mode, B frames always use the inter 
mode. 


Motion Compensation 
Motion compensation is optional in the en- 
coder. The decoder must support accepting 
one motion vector per macroblock (one or four 
motion vectors per macroblock in the optional 
advanced prediction or deblocking filter modes). 
In the optional PB frame mode, each mac- 
roblock may have an additional vector. In the 
optional improved PB frame mode, each mac- 
roblock can include an additional forward 
motion vector. In the optional B frame mode, 
macroblocks can be transmitted with both a 
forward and backward motion vector. 


For baseline H.263, motion vectors are 
restricted such that all samples referenced by 
them are within the coded picture area. Many 
of the optional modes remove this restriction. 
The horizontal and vertical components of 
motion vectors have integer or half-integer val- 
ues not exceeding —-16 to +15.5. Several of the 
optional modes increase the range to [-31.5, 
+31.5] or [-31.5, +30.5]. 

A positive value of the horizontal or verti- 
cal component of the motion vector typically 
indicates that the prediction is formed from 
samples in the previous frame which are spa- 
tially to the right or below the samples being 
predicted. However, for backward motion vec- 
tors in B frames, a positive value of the hori- 
zontal or vertical component of the motion 
vector indicates that the prediction is formed 
from samples in the next frame which are spa- 
tially to the left or above the samples being pre- 
dicted. 


Quantization 

The number of quantizers is 1 for the first intra 
coefficient and 31 for all other coefficients. 
Within a macroblock, the same quantizer is 
used for all coefficients except the first one of 
intra blocks. The first intra coefficient is usu- 
ally the transform DC value linearly quantized 
with a step size of 8 and no dead zone. Each of 
the other 31 quantizers are also linear, but with 
a central dead zone around zero and a step size 
of an even value in the range of 2-62. 


Coding Control 

Although not a part of H.263, several parame- 
ters may be varied to control the rate of coded 
video data. These include processing prior to 
coding, the quantizer, block significance crite- 
rion, and temporal subsampling. 
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Figure 10.8. Typical Baseline H.263 Encoder. 
Parameters 16CIF 4CIF CIF QCIF SQCIF 
active resolution (Y) 1408 x 1152 704 x 576 352 x 288 176 x 144 128 x 96 
frame refresh rate 29.97 Hz 
YCbCr sampling structure 4:2:0 
form of YCbCr coding Uniformly quantized PCM, 8 bits per sample. 


Table 10.9. Baseline H.263 YCbCr Parameters. 


466 = Chapter 10: H.261 and H.263 


Forced Updating 

This is achieved by forcing the use of the intra 
mode. To control the accumulation of inverse 
transform mismatch errors, a macro-block 
should be forcibly updated at least once every 
132 times it is transmitted. 


Video Bitstream 


Unless specified otherwise, the most signifi- 
cant bits are transmitted first. Bit 1, the left- 
most bit in the code tables, is the most 
significant. Unless specified otherwise, all 
unused or spare bits are set to “1.” 


352 SAMPLES 


GROUP OF BLOCKS 


The video multiplexer is arranged in a 
hierarchical structure with four layers. From 
top to bottom the layers are: 


Picture 

Group of Blocks (GOB) or Slice 
Macroblock (MB) 

Block 


Picture Layer 

Data for each picture consists of a picture 
header followed by data for a group of blocks 
(GOBs), followed by an_ end-of-sequence 
(EOS) and stuffing bits (PSTUF). The baseline 
structure is shown in Figure 10.10. Picture 
headers for dropped pictures are not transmit- 
ted. 


BLOCK ARRANGEMENT 
WITHIN A MACROBLOCK 


CB DCT 5 
CR DCT 4 
DCT O DCT 1 
Y 
DCT 2 DCT 3 


MACROBLOCKS 


Figure 10.9. H.263 Arrangement of Group of Blocks, Macroblocks, and Blocks. 
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Picture Start Code (PSC) 

PSC is a 22-bit word with a value of 0000 0000 
0000 0000 1 00000. It must be byte-aligned; 
therefore, 0-7 zero bits are added before the 
start code to ensure the first bit of the start 
code is the first, and most significant, bit of a 


byte. 


Temporal Reference (TR) 

TR is an 8-bit binary number representing 256 
possible values. It is generated by increment- 
ing its value in the previously transmitted pic- 
ture header by one and adding the number of 
non-transmitted 29.97 Hz pictures since the 
last transmitted one. The arithmetic is per- 
formed with only the eight LSBs. 


If a custom picture clock frequency (PCF) 
is indicated, Extended TR (ETR) and TR form 
a 10-bit number where TR stores the eight 
LSBs and ETR stores the two MSBs. The arith- 
metic in this case is performed with the ten 
LSBs. 

In the PB frame and improved PB frame 
mode, TR only addresses P frames. 


PICTURE LAYER PSC TR PTYPE 


PQUANT 


CPM | PSBI TRB 


DBQUANT PEI 


PSUPP. PEI GOB 


GOB -- | GOB EOS PSTUF 


GOB LAYER GBSC GN GSBI GFID GQUANT MB MB tse MB 
MACROBLOCK LAYER COD MCBPC MODB CBPB CBPY DQUANT 
MVD MVDo2 MVD3 MVD4 MVDB BLOCK 1 BLOCK 6 
BLOCK LAYER INTRADC TCOEF 


Figure 10.10. Baseline H.263 Video Bitstream Layer Structures (Without Optional PLUSPTYPE 


Related Fields in the Picture Layer). 
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Type Information (PTYPE) 
PTYPE contains 13 bits of information about 
the picture: 


Bit1 “1” 

Bit2 “0” 

Bit3 Split screen indicator 
“Q” = off, a (ied = on 

Bit4 Document camera indicator 
“Q” — off, il lie = on 

Bit5 Freeze picture release 


“Q” = off, 1? = on 


Bit 6-8 Source format 
“000” = reserved 


“001” = SQCIF 
“010” = QCIF 
“011” = CIF 
“100” = 4CIF 
“101” = 16CIF 


“110” = reserved 
“111” = extended PTYPE 


If bits 6-8 are not “111,” the following five 
bits are present in PTYPE: 


Bit9 Picture coding type 
“0” = intra, “1” = inter 

Bit 10 Optional unrestricted motion 
vector mode 
“Q” = off, a ed = on 

Bit 11 Optional syntax-based arithmetic 


coding mode 
“Q” = off, bal ee = on 


Bit12 Optional advanced prediction 
mode 


“Q” = off, fl ba = on 


Bit13 Optional PB frames mode 
“0” = normal picture 


“1” = PB frame 


If bit 9 is set to “O,” bit 13 must be set to a “0.” 
Bits 10-13 are optional modes that are negoti- 
ated between the encoder and decoder. 


Quantizer Information (PQUANT) 

PQUANT is a 5-bit binary number (value of 1- 
31) representing the quantizer to be used until 
updated by a subsequent GQUANT or 
DQUANT. 


Continuous Presence Multipoint (CPM) 

CPM is a 1-bit value that signals the use of the 
optional continuous presence multipoint and 
video multiplex mode; “0” = off, “1” = on. CPM 
immediately follows PQUANT if PLUSPTYPE 
is not present, and is immediately after PLUSP- 
TYPE if PLUSPTYPE is present. 


Picture Sub-Bitstream Indicator (PSBI) 
PSBI is an optional 2-bit binary number that is 
only present if the optional continuous presence 


multipoint and video multiplex mode is indi- 
cated by CPM. 
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Temporal Reference of B Frames in PB 
Frames (TRB 
TRB is present if PTYPE or PLUSTYPE indi- 
cate a PB frame or improved PB frame. TRB isa 
3-bit or 5-bit binary number of the [number + 
1] of nontransmitted pictures (at 29.97 Hz or 
the custom picture clock frequency indicated 
in CPCFC) since the last I or P frame or the P- 
part of a PB frame or improved PB frame and 
before the B-part of the PB frame or improved 
PB frame. The value of TRB is extended to 5 
bits when a custom picture clock frequency is 
in use. 

The maximum number of non-transmitted 
pictures is six for 29.97 Hz, or thirty when a 
custom picture clock frequency is used. 


Quantizer Information for B Frames in PB 
Frames (DBQUANT) 

DBQUANT is present if PTYPE or PLUSTYPE 
indicate a PB frame or improved PB frame. 
DBQUANT is a 2-bit codeword indicating the 
relationship between QUANT and BQUANT 
as shown in Table 10.10. The division is done 
using truncation. BQUANT has a range of 1- 
31. If the result is less than 1 or greater than 
31, BQUANT is clipped to 1 and 31, respec- 
tively. 


DBQUANT BQUANT 
00 (5 * QUANT) / 4 
01 (6 * QUANT) / 4 
10 (7 * QUANT) / 4 
11 (8 * QUANT) / 4 


Table 10.10. Baseline H.263 DBQUANT 
Codes and QUANT/BQUANT Relationship. 


Extra Insertion Information (PEI) 
PEI is a bit which when set to “1” signals the 
presence of the PSUPP data field. 


Supplemental Enhancement Information 
(PSUPP) 

If PEI is set to “1,” then 9 bits follow consisting 
of 8 bits of data (PSUPP) and another PEI bit 
to indicate if a further 9 bits follow, and so on. 


End of Sequence (EOS) 

EOS is a 22-bit word with a value of 0000 0000 
0000 0000 1 11111. EOS must be byte aligned 
by inserting 0-7 zero bits before the code so 
that the first bit of the EOS code is the first, 
and most significant, bit of a byte. 


Stuffing (PSTUF) 

PSTUF is a variable-length word of zero bits. 
The last bit of PSTUF must be the last, and 
least significant, bit of a byte. 


Group of Blocks (GOB) Layer 

As shown in Figure 10.9, each picture is 
divided into groups of blocks (GOBs). A GOB 
comprises 16 lines for the SQCIF, QCIF, and 
CIF resolutions, 32 lines for the 4CIF resolu- 
tion, and 64 lines for the 16CIF resolution. 
Thus, a SQCIF picture contains 6 GOBs (96/ 
16) each with one row of macroblock data. 
QCIF pictures have 9 GOBs (144/16) each 
with one row of macroblock data. A CIF pic- 
ture contains 18 GOBs (288/16) each with one 
row of macroblock data. 4CIF pictures have 18 
GOBs (576/32) each with two rows of macro- 
block data. A 16CIF picture has 18 GOBs 
(1152/64) each with four rows of macroblock 
data. GOB numbering starts with 0 at the top 
of picture, and increases going down vertically. 
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Data for each GOB consists of a GOB 
header followed by macroblock data, as shown 
in Figure 10.10. Macroblock data is transmit- 
ted in increasing macroblock number order. 
For GOB number 0 in each picture, no GOB 
header is transmitted. A decoder can signal an 
encoder to transmit only non-empty GOB 
headers. 


Group of Blocks Start Code (GBSC) 

GBSC is a 17-bit word with a value of 0000 0000 
0000 0000 1. It must be byte-aligned; therefore, 
0-7 zero bits are added before the start code to 
ensure the first bit of the start code is the first, 
and most significant, bit of a byte. 


Group Number (GN) 

GN is a 5-bit binary number indicating the 
number of the GOB. Group numbers 1-17 are 
used with the standard picture formats. Group 
numbers 1-24 are used with custom picture 
formats. Group numbers 16-29 are emulated 
in the slice header. Group number 30 is used in 
the end of sub-bitstream indicators (EOSBS) 
code and group number 31 is used in the end 
of sequence (EOS) code. 


GOB Sub-Bitstream Indicator (GSBI) 

GSBI is a 2-bit binary number representing the 
sub-bitstream number until the next picture or 
GOB start code. GSBI is present only if contin- 
uous presence multipoint and video multiplex 
(CPM) mode is enabled. 


GOB Frame ID (GFID) 

GFID is a 2-bit value indicating the frame ID. It 
must have the same value in every GOB (or 
slice) header of a given frame. In general, if 
PTYPE is the same as for the previous picture 
header, the GFID value must be the same as 
the previous frame. If PTYPE has changed 
from the previous picture header, GFID must 
have a different value from the previous frame. 


Quantizer Information (GQUANT) 

GQUANT is a 5-bit binary number that indi- 
cates the quantizer to be used in the group of 
blocks until overridden by any subsequent 
GQUANT or DQUANT. The codewords are 
the binary representations of the values 1-31. 


Macroblock (MB) Layer 
Each GOB is divided into macroblocks, as 
shown in Figure 10.9. A macroblock relates to 
16 samples x 16 lines of Y and the correspond- 
ing 8 samples x 8 lines of Cb and Cr. Macro- 
block numbering increases left-to-right and 
top-to-bottom. Macroblock data is transmitted 
in increasing macroblock numbering order. 
Data for a macroblock consists of a MB 
header followed by data for blocks (see Figure 
10.10). 


Coded Macroblock Indication (COD) 

COD is a single bit that indicates whether or 
not the block is coded. “0” indicates coded; “1” 
indicates not coded, and the rest of the mac- 
roblock layer is empty. COD is present only in 
pictures that are not intra. 

If not coded, the decoder processes the 
macroblock as an inter block with motion vec- 
tors equal to zero for the whole block and no 
coefficient data. 


Macroblock Type and Coded Block Pattern for 
Chrominance (MCBPC) 

MCBPC is a variable-length codeword indicat- 
ing the macroblock type and the coded block 
pattern for Cb and Cr. 

Codewords for MCBPC are listed in 
Tables 10.11 and 10.12. A codeword is available 
for bit stuffing, and should be discarded by 
decoders. In some cases, bit stuffing must not 
occur before the first macroblock of the pic- 
ture to avoid start code emulation. The mac- 
roblock types (MB Type) are listed in Tables 
10.13 and 10.14. 
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MB Type igi Code 
3 0,0 1 
3 0,1 001 
3 1,0 010 
3 1,1 011 
4 0, 0 0001 
4 0,1 0000 01 
4 1,0 0000 10 
4 1,1 0000 11 
stuffing 0000 0000 1 


Table 10.11. Baseline H.263 Variable-Length Code Table for MCBPC for | Frames. 


The coded block pattern for chrominance 
(CBPC) signifies when a non-intra DC trans- 
form coefficient is transmitted for Cb or Cr. A 
“1” indicates a non-intra DC coefficient is 
present that block. 


Macroblock Mode for B Blocks (MODB) 
MODDB is present for macroblock types 0-4 if 
PTYPE indicates PB frame. It is a variable- 
length codeword indicating whether B coeffi- 
cients and/or motion vectors are transmitted 
for this macroblock. Table 10.15 lists the code- 
words for MODB. MODB is coded differently 
for improved PB frames. 


Coded Block Pattern for B Blocks (CBPB) 
The 6-bit CBPB is present if indicated by 
MODB. It indicates which blocks in the mac- 
roblock have at least one transform coefficient 
transmitted. The pattern number is repre- 
sented as: 


PoP 1 PoP3P4P5 


where P,, = “1” for any coefficient present for 
block [n], else P, = “0.” Block numbering is 
given in Figure 10.9. 


Coded Block Pattern for Luminance (CBPY) 
CBPY is a variable-length codeword specifying 
the Y blocks in the macroblock for which at 
least one non-intra DC transform coefficient is 
transmitted. However, in the advanced intra 
coding mode, intra DC is indicated in the same 
manner as the other coefficients. 

Table 10.16 lists the codes for CBPY. Yy is 
a “1” if any non-intra DC coefficient is present 
for that Y block. Y block numbering is as 
shown in Figure 10.9. 


Quantizer Information (DQUANT) 

DQUANT is a 2-bit codeword signifying the 
change in QUANT. Table 10.17 lists the differ- 
ential values for the codewords. 

QUANT has a range of 1-31. If the value of 
QUANT as a result of the indicated change is 
less than 1 or greater than 31, it is made 1 and 
31, respectively. 
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MB Type ice ch Code 

0 0, 0 1 

0 0,1 0011 

0 1,0 0010 

0 1,1 0001 01 

1 0, 0 011 

1 0,1 0000 111 

1 1,0 0000 110 

1 1,1 0000 0010 1 

2 0, 0 010 

2 0,1 0000 101 

2 1,0 0000 100 

2 1,1 0000 0101 

3 0, 0 0001 1 

3 0,1 0000 0100 

3 1,0 0000 0011 

3 p Le | 0000 011 

4 0, 0 0001 00 

4 0,1 0000 0010 0 

4 1,0 0000 0001 1 

4 1,1 0000 0001 0 
stuffing 0000 0000 1 

5 0, 0 0000 0000 010 

5 0,1 0000 0000 0110 

5 1,0 0000 0000 0111 

5 1,1 0000 0000 0111 1 


Table 10.12. Baseline H.263 Variable-Length Code Table for MCBPC for P Frames. 
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pa me Name cop McBpc | cBPY | DQUANT | MVD MVD>_4 
ype Type 
inter noe - x 
coded 
inter 0 inter x x x 
inter 1 inter + q x x x x 
inter 2 inter4v x x x x 
inter 3 intra x x x 
inter 4 intra + q x x x 
inter 5 inter4v + q x x x x x 
inter stuffing - x x 
intra 3 intra x x 
intra 4 intra + q x x 
intra stuffing - x 


Table 10.13. Baseline H.263 Macroblock Types and Included Data for Normal Frames. 


pete fe Name cop McBpc | MODB | cCBPY 

inter Ries d - x 

inter 0 inter x x x x 
inter 1 inter + q x x x x 
inter 2 inter4v x x x x 
inter 3 intra x x x x 
inter 4 intra + q x x x x 
inter 5 inter4v + q x x x x 
inter stuffing - x x 


Table 10.14a. Baseline H.263 Macroblock Types and Included Data for PB Frames. 
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iy Me Name cBpB | DQUANT | Mvp | MvpB | MvD,, 
ype Type 

inter ese = 

inter (0) inter x x x 

inter 1 inter + q x x x x 

inter 2 inter4v x x x x 
inter 3 intra x x x 

inter 4 intra + q x x x 

inter 5 inter4v + q x x x x 
inter stuffing - 


Table 10.14b. Baseline H.263 Macroblock Types 


CBPB 


MVDB 


Table 10.15. Baseline H.263 Variable-Length Code Table for MODB. 


and Included Data for PB Frames. 
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CBPY 
(YO, Y1, Y2, Y3) Code 
Intra Inter 
0, 0, 0, O 1,1,1,1 0011 
0, 0, 0, 1 1,1,1,0 0010 1 
0,0, 1,0 1,1,0,1 0010 0 
0,0,1,1 1,1, 0,0 1001 
0,1, 0,0 1,0,1,1 0001 1 
0,1, 0,1 1,0,1,0 0111 
0,1,1,0 1,0, 0,1 0000 10 
0,1,1,1 1, 0, 0, O 1011 
1,0, 0,0 0,1,1,1 0001 0 
1,0, 0,1 0,1,1,0 0000 11 
1,0,1,0 0,1,0,1 0101 
1,0,1,1 0, 1, 0, 0 1010 
1,1,0,0 0,0, 1,1 0100 
1,1,0,1 0, 0, 1, 0 1000 
1,1,1,0 0, 0, 0, 1 0110 
1,1,1,1 0, 0, 0, O 11 


Table 10.16. Baseline H.263 Variable-Length Code Table for CBPY. 


Differential Value 


of QUANT Baten 
-1 00 
-2 OL 
1 10 
i 


Table 10.17. Baseline H.263 DQUANT Codes for QUANT Differential Values. 
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Motion Vector Data (MVD) 

Motion vector data is included for all inter mac- 
roblocks and intra blocks when in PB frame 
mode. 

Motion vector data consists of a variable- 
length codeword for the horizontal component, 
followed by a variable-length codeword for the 
vertical component. The variable-length codes 
are listed in Table 10.18. For the unrestricted 
motion vector mode, other motion vector cod- 
ing may be used. 


Motion Vector Data (MVD2-4) 

The three codewords MVDo, MVDs, and 
MVD, are present if indicated by PTYPE and 
MCBPC during the advanced prediction or 
deblocking filter modes. Each consists of a vari- 
able-length codeword for the horizontal com- 
ponent followed by a variable-length codeword 
for the vertical component. The variable-length 
codes are listed in Table 10.18. 


Motion Vector Data for B Macroblock 
MVDB 

MVDB is present if indicated by MODB dur- 
ing the PB frame and improved PB frame 
modes. It consists of a variable-length code- 
word for the horizontal component followed by 
a variable-length codeword for the vertical 
component of each vector. The variable-length 
codes are listed in Table 10.18. 


Block Layer 

If not in PB frames mode, a macroblock is 
made up of four Y blocks, a Cb block, and a Cr 
block (see Figure 10.9). Data for an 8 sample x 
8 line block consists of codewords for the intra 
DC coefficient and transform coefficients as 
shown in Figure 10.10. The order of block 
transmission is shown in Figure 10.9. 


In PB frames mode, a macroblock is made 
up of four Y blocks, a Cb block, a Cr block, and 
data for six B blocks. 

The quantized DCT coefficients are trans- 
mitted in the order shown in Figure 7.50. In 
the modified quantization mode, quantized 
DCT coefficients are transmitted in the order 
shown in Figure 7.51. 


DC Coefficient for Intra Blocks Untra DC) 
Intra DC is an 8-bit codeword. The values and 
their corresponding reconstruction levels are 
listed in Table 10.19. 

If not in PB frames mode, the intra DC 
coefficient is present for every block of the 
macroblock if MCBPC indicates macroblock 
type 3 or 4. In PB frames mode, the intra DC 
coefficient is present for every P block if 
MCBPC indicates macroblock type 3 or 4 (the 
intra DC coefficient is not present for B 
blocks). 


Transform Coefficient (TCOEF) 

If not in PB frames mode, TCOFF is present if 
indicated by MCBPC or CBPY. In PB frames 
mode, TCOFF is present for B blocks if indi- 
cated by CBPB. 

An event is a combination of a last non-zero 
coefficient indication (LAST = “0” if there are 
more non-zero coefficients in the block; LAST 
= “1” if there are no more non-zero coefficient 
in the block), the number of successive zeros 
preceding the coefficient (RUN), and the non- 
zero coefficient (LEVEL). 

The most common events are coded using 
a variable-length code, shown in Table 10.20. 
The “s” bit indicates the sign of the level; “0” 
for positive, and “1” for negative. 

Other combinations of (LAST; RUN, 
LEVEL) are encoded using a 22-bit word: 7 bits 
of escape (ESC), 1 bit of LAST, 6 bits of RUN, 
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pif code 

-16 16 0000 0000 0010 1 
-15.5 16.5 0000 0000 0011 1 
-15 17 0000 0000 0101 
-14.5 17.5 0000 0000 0111 
-14 18 0000 0000 1001 
-13.5 18.5 0000 0000 1011 
-13 19 0000 0000 1101 
-12.5 19.5 0000 0000 1111 
-12 20 0000 0001 001 
-11.5 20.5 0000 0001 011 
-11 21 0000 0001 101 
-10.5 21.5 0000 0001 111 
-10 22 0000 0010 001 
-9.5 22.5 0000 0010 011 

-9 23 0000 0010 101 
-8.5 23.5 0000 0010 111 

-8 24 0000 0011 001 
-7.5 24.5 0000 0011 011 

ee 25 0000 0011 101 
-6.5 25.5 0000 0011 111 

-6 26 0000 0100 001 
-5.5 26.5 0000 0100 011 

-5 27 0000 0100 11 
-4.5 27.5 0000 0101 01 

-4 28 0000 0101 11 
-3.5 28.5 0000 0111 

-3 29 0000 1001 

-2.5 29.5 0000 1011 

-2 30 0000 111 

-1.5 30.5 0001 1 

=i 31 0011 

-0.5 31.5 O11 

0 1 


Table 10.18a. Baseline H.263 Variable-Length Code Table for MVD, MVD 5_4, and MVDB. 
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wine ones eons 
0.5 -31.5 | 010 
1 -31 0010 
1.5 -30.5 | 0001 0 
2 -30 0000 110 
2.5 -29.5 | 0000 1010 
3 -29 0000 1000 
3.5 -28.5 | 0000 0110 
4 -28 0000 0101 10 
4.5 -27.5 | 0000 0101 00 
5 S57 0000 0100 10 
5.5 -26.5 | 0000 0100 010 
6 -26 0000 0100 000 
6.5 -25.5 | 0000 0011 110 
7 -25 0000 0011 100 
7.5 -24.5 | 0000 0011 010 
8 -24 0000 0011 000 
8.5 -23.5 | 0000 0010 110 
9 253 0000 0010 100 
9.5 -22.5 | 0000 0010 010 
10 -22 0000 0010 000 
10.5 -21.5 | 0000 0001 110 
11 291 0000 0001 100 
11.5 -20.5 | 0000 0001 010 
12 -20 0000 0001 000 
12.5 -19.5 | 0000 0000 1110 
13 -19 0000 0000 1100 
13.5 -18.5 | 0000 0000 1010 
14 -18 0000 0000 1000 
14.5 -17.5 | 0000 0000 0110 
15 217 0000 0000 0100 
15.5 -16.5 | 0000 0000 0011 0 


Table 10.18b. Baseline H.263 Variable-Length Code Table for MVD, MVD 5_4, and MVDB. 
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Intra DC 
Value 


Reconstruction 
Level 


0000 0000 


not used 


0000 0001 


8 


0000 0010 


16 


0000 0011 


24 


01111111 


1111 1111 


1000 0001 


1111 1101 


1111 1110 


Table 10.19. Baseline H.263 Reconstruction Levels for Intra DC. 


and 8 bits of LEVEL. The codes for RUN and 
LEVEL are shown in Table 10.21. Code 1000 
0000 is forbidden unless in the modified quanti- 
zation mode. 

All coefficients, except for intra DC, have 
reconstruction levels (REC) in the range —2048 
to 2047. Reconstruction levels are recovered 
by the following equations, and the results are 
clipped. 


if LEVEL = 0, REC = 0 


if QUANT = odd: 
|REC| = QUANT x (2 x |LEVEL| + 1) 


if QUANT = even: 
[REC] = QUANT x (2 x [LEVEL] + 1) - 1 


After calculation of |REC|, the sign is 
added to obtain REC. Sign(LEVEL) is specified 
by the “s” bit in the TCOEF code in Table 
10.20. 


REC = sign(LEVEL) x |REC| 


For intra DC blocks, the reconstruction 
level is: 


REC = 8x LEVEL 
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Last Run |Level] Code 
0 0 1 10s 
0 0 2 1111 s 
0 0 3 0101 Ols 
0 0 4 0010 111s 
0 0 5 0001 1111 s 
0 0 6 0001 0010 ls 
0 0 7 0001 0010 Os 
0 0 8 0000 1000 Ols 
0 0 9 0000 1000 00s 
0 0 10 0000 0000 111s 
0 0 11 0000 0000 110s 
0 0 12 0000 0100 000s 
0 1 1 110s 
0 1 2 0101 00s 
0 1 3 0001 1110 s 
0 1 4 0000 0011 11s 
0 1 5 0000 0100 001s 
0 1 6 0000 0101 0000 s 
0 2 1 1110 s 
0 2 2 0001 1101 s 
0 2 3 0000 0011 10s 
0 2 4 0000 0101 0001 s 
0 3 1 0110 ls 
0 3 2 0001 0001 1s 
0 3 3 0000 0011 Ols 
0 4 1 0110 Os 
0 4 2 0001 0001 Os 
0 4 3 0000 0101 0010 s 
0 5 1 0101 ls 
0 5 2 0000 0011 00s 
0 5 3 0000 0101 0011 s 
0 6 1 0100 11s 
0 6 2 0000 0010 11s 
0 6 3 0000 0101 0100 s 
0 7 1 0100 10s 


Table 10.20a. Baseline H.263 Variable-Length Code Table for TCOEF. 
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Last Run |Level] Code 
0 7 2 0000 0010 10s 
0 8 1 0100 Ols 
0 8 2 0000 0010 Ols 
0 9 1 0100 00s 
0 9 2 0000 0010 00s 
0 10 1 0010 110s 
0 10 2 0000 0101 0101 
0 11 1 0010 101s 
0 12 1 0010 100s 
0 13 1 0001 1100 s 
0 14 1 0001 1011 s 
0 15 1 0001 0000 ls 
0 16 1 0001 0000 Os 
0 17 1 0000 1111 ls 
0 18 1 0000 1111 Os 
0 19 1 0000 1110 1s 
0 20 1 0000 1110 0s 
0 21 1 0000 1101 1s 
0 22 1 0000 1101 Os 
0 23 1 0000 0100 010s 
0 24 1 0000 0100 011s 
0 25 1 0000 0101 0110 
0 26 1 0000 0101 0111 
1 0 1 0111 s 
1 0 2 0000 1100 1s 
1 0 3 0000 0000 101s 
1 1 1 0011 11s 
1 1 2 0000 0000 100s 
1 2 1 0011 10s 
1 3 1 0011 Ols 
1 4 1 0011 00s 
1 5 1 0010 011s 
1 6 1 0010 010s 
1 7 1 0010 001s 


Table 10.20b. Baseline H.263 Variable-Length Code Table for TCOEF. 
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Last Run |Level] Code 
8 1 0010 000s 
1 9 1 0001 1010 s 
1 10 1 0001 1001 s 
1 11 1 0001 1000 s 
1 12 1 0001 0111 s 
1 13 1 0001 0110 s 
1 14 1 0001 0101 s 
1 15 1 0001 0100 s 
1 16 1 0001 0011 s 
1 17 1 0000 1100 Os 
1 18 1 0000 1011 1s 
1 19 1 0000 1011 0s 
1 20 1 0000 1010 ls 
1 21 1 0000 1010 0s 
1 22 1 0000 1001 ls 
1 23 1 0000 1001 Os 
1 24 1 0000 1000 ls 
1 25 1 0000 0001 11s 
1 26 1 0000 0001 10s 
1 27 1 0000 0001 Ols 
1 28 1 0000 0001 00s 
1 29 1 0000 0100 100s 
1 30 1 0000 0100 101s 
1 31 1 0000 0100 110s 
1 32 1 0000 0100 111s 
1 33 1 0000 0101 1000 s 
1 34 1 0000 0101 1001 s 
1 35 1 0000 0101 1010 s 
1 36 1 0000 0101 1011 s 
1 37 1 0000 0101 1100 s 
1 38 1 0000 0101 1101 s 
1 39 1 0000 0101 1110 s 
1 40 1 0000 0101 1111 s 
ESC 0000 011 


Table 10.20c. Baseline H.263 Variable-Length Code Table for TCOEF. 
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Run Code 
0 0000 00 
1 0000 01 

63 1111 11 


Level Code 
-128 forbidden 
-127 1000 0001 
-2 1111 1110 
-l 1111 1111 
0 forbidden 
1 0000 0001 
2 0000 0010 
127 0111 1111 


Table 10.21. Baseline H.263 Run, Level Codes. 


PLUSPTYPE Picture Layer Option 
PLUSTYPE is present when indicated by bits 
6-8 of PTYPE, and is used to enable the H.263 
version 2 options. When present, the 
PLUSTYPE and related fields immediately fol- 
low PTYPE, preceding PQUANT. 

If PLUSPTYPE is present, then CPM 
immediately follows PLUSPTYPE. If PLUSP- 
TYPE is not present, then CPM immediately 
follows PQUANT. PSBI always immediately fol- 
lows CPM (if CPM = “1”). 

PLUSTYPE is a 12- or 30-bit codeword, 
comprised of up to three subfields: UFEP OPP- 
TYPE, and MPPTYPE. The PLUSTYPE and 
related fields are illustrated in Figure 10.11. 


Update Full Extended PTYPE (UFEP) 
UFEP is a 3-bit codeword present if “extended 
PTYPE” is indicated by PTYPE. 

A value of “000” indicates that only MPP- 
TYPE is included in the picture header. 

A value “001” indicates that both OPP- 
TYPE and MPPTYPE are included in the pic- 
ture header. If the picture type is intra or EI, 
this field must be set to “001.” 

In addition, if PLUSPTYPE is present in 
each of a continuing sequence of pictures, this 
field shall be set to “001” every five seconds or 
every five frames, whichever is larger. UFEP 
should be set to “001” more often in error- 
prone environments. 

Values other than “000” and “001” are 
reserved. 
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Optional Part of PLUSPTYPE (OPPTYPE) 


This field contains features that are not likely 
to be changed from one frame to another. If 
UFEP is “001,” the following bits are present in 


OPPTYPE: 


Bit 1-3 Source format 


Bit 4 


Bit 5 


Bit 6 


Bit 7 


Bit 8 


“000” = reserved 


“001” = SQCIF 
“010” = QCIF 
“011” = CIF 
“100” = 4CIF 
“101” = 16CIF 


“110” = custom source format 
“111” = reserved 


Custom picture clock frequency 
“0” = standard, “1” = custom 
Unrestricted motion vector (UMV) 
mode 

“Q” = off, ml ind = on 

Syntax-based arithmetic coding 
(SAC) mode 

“Q” = off, 1” = on 

Advanced prediction (AP) mode 
“Q” = off, bal licd = on 

Advanced intra coding 

(AIC) mode 

“Q” = off, sp? = on 


Bit 9 


Bit 10 


Bit 11 


Bit 12 


Bit 13 


Bit 14 


Bit 15 
Bit 16 
Bit 17 
Bit 18 


Deblocking filter (DF) mode 
“Q” = off, “y” = on 


Slice-structured (SS) mode 
mode 


“Q” = off, oy? = on 

Reference picture selection 
(RPS) mode 

“Q” = off, ae had = on 
Independent segment decoding 
(ISD) mode 

“Q” = off, Ba hed = on 

Alternative Inter VLC 

(AIV) mode 

“Q” = off, a ba = on 


Modified quantization (MQ) mode 
“Q” = off, “y” = on 


“1 
«oq? 
«oq? 
«Q” 


oe PLUSTYPE CPM PSBI CPFMT EPAR | CPCFC | ETR | UUI Sss 


ELNUM RLNUM 


RPSMF 


TRPI 


TRP | BCI | BCM 


RPRP 


Figure 10.11. H.263 PLUSPTYPE and Related Fields. 
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Mandatory Part of PLUSPTYPE (MPPTYPE) 
Regardless of the value of UFEP the following 
9 bits are also present in MPPTYPE: 


Bit 1-3 Picture code type 
“000” = I frame (intra) 
“001” = P frame (inter) 
“010” = Improved PB frame 
“011” = B frame 
“100” = EI frame 
“101” = EP frame 
“110” = reserved 
“111” = reserved 


Bit4 Reference picture resampling 
(RPR) mode 
“()? = off, ai bes = on 

Bit5 Reduced resolution update 
(RRU) mode 
“Q” = off, ba Be = on 

Bit6 Rounding type (RTYPE) mode 
“Q” = off, an hd = on 

Bit7 “0” 

Bit8 “0” 

Bit9 “1” 


Custom Picture Format (CPFMT) 

CPFMT is a 23-bit value that is present if the 
use of a custom picture format is specified by 
PLUSPTYPE and UFEP is “001.” 


Bit 1-4 _ Pixel aspect ratio code 
“0000” = reserved 
“0001” = 1:1 
“0010” = 12:11 
“0011” = 10:11 
“0100” = 16:11 
“0101” = 40:33 


“0110” — “1110” = reserved 
“1111” = extended PAR 


Bit 5-13 Picture width indication (PWI) 
number of samples per line = 
(PWI +1) x4 


Bit 14 “yy” 


Bit 15-23 Picture height indication (PHD 
number of lines per frame = 
(PHI +1) x4 


Extended Pixel Aspect Ratio (EPAR) 

EPAR is a 16-bit value present if CPFMT is 
present and “extended PAR” is indicated by 
CPFMT. 


Bit1-8 PAR width 
Bit 9-16 PAR height 


Custom Picture Clock Frequency Code 
(CPCFC) 

CPCFC is an 8-bit value present only if PLUSP- 
TYPE is present, UFEP is “001,” and PLUSP- 
TYPE indicates a custom picture clock 
frequency. The custom picture clock fre- 
quency (in Hz) is: 


1,800,000 / (clock divisor x clock conversion factor) 


Bit 1 Clock conversion factor code 
“Q” = 1000, “1” = 1001 
Bit 2-8 Clock divisor 


Extended Temporal Reference (ETR) 

ETR is a 2-bit value present if a custom picture 
clock frequency is in use. It is the two MSBs of 
the 10-bit TR value. 
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Unlimited Unrestricted Motion Vectors 
Indicator (UUI 

UUlis a 1- or 2-bit variable-length value indicat- 
ing the effective range limit of motion vectors. 
It is present if the optional unrestricted motion 
vector mode is indicated in PLUSPTYPE and 
UFEP is “001.” 

A value of “1” indicates the motion vector 
range is limited according to Tables 10.22 and 
10.23. A value of “01” indicates the motion vec- 
tor range is not limited except by the picture 
size. 


Horizontal Motion 


Picture Width Vector Range 


4-352 -32, +31.5 
356-704 -64, +63.5 
708-1408 -128, +127.5 
1412-2048 —256, +255.5 


Table 10.22. Optional Horizontal Motion 
Range. 


Vertical Motion 


Picture Height 


Vector Range 


4-288 -32, +31.5 
292-576 -64, +63.5 
580-1152 —128, +127.5 


Table 10.23. Optional Vertical Motion 
Range. 


Slice Structured Submode Bits (SSS) 

SSS is a 2-bit value present only if the optional 
slice structured mode is indicated in PLUSP- 
TYPE and UFEP is “001.” If the slice structured 
mode is in use but UFEP is not “001,” the last 
SSS value remains in effect. 


Bit1 Rectangular slices 
“<Q” =no, ad = yes 


Bit2 Arbitrary slice ordering 
“0” = sequential, “1” = arbitrary 


Enhancement Layer Number (ELNUM) 
ELNUM isa 4-bit value present only during the 
temporal, SNR, and spatial scalability mode. It 
identifies a specific enhancement layer. The 
first enhancement layer above the base layer is 
designated as enhancement layer number 2, 
and the base layer is number 1. 


Reference Layer Number (RLNUM) 

RLNUM is a 4-bit value present only during the 
temporal, SNR, and spatial scalability mode 
UFEP is “001.” The layer number for the 
frames used as reference anchors is identified 
by the RLNUM. 


Reference Picture Selection Mode Flags 
(RPSMF) 

RPSMF is a 3-bit codeword present only dur- 
ing the reference picture selection mode and 
UFEP is “001.” When present, it indicates 
which back-channel messages are needed by 
the encoder. If the reference picture selection 
mode is in use but RPSMF is not present, the 
last value of RPSMF that was sent remains in 
effect. 


“000” — “011” = reserved 

“100” = neither ACK nor NACK needed 
“101” = need ACK 

“110” = need NACK 

“111” = need both ACK and NACK 
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Temporal Reference for Prediction Indication 


Reference Picture Resampling Parameters 


TRPI 
TRPI is a 1-bit value present only during the 
reference picture selection mode. When present, 
it indicates the presence of the following TRP 
field. “O” = TRP field not present; “1” = TRP 
field present. TRPI is “0” whenever the picture 
header indicates an I frame or EI frame. 


Temporal Reference for Prediction (TRP) 

TRP is a 10-bit value indicating the temporal 
reference used for encoding prediction, except 
in the case of B frames. For B frames, the 
frame having the temporal reference specified 
by TRP is used for the prediction in the for- 
ward direction. 

If the custom picture clock frequency is 
not being used, the two MSBs of TRP are zero 
and the LSBs contain the 8-bit TR value in the 
picture header of the reference picture. If a 
custom picture clock frequency is being used, 
TRP is a 10-bit number consisting of the con- 
catenation of ETR and TR from the reference 
picture header. 

If TRP is not present, the previous anchor 
picture is used for prediction, as when not in 
the reference picture selection mode. TRP is 
valid until the next PSC, GSC, or SSC. 


Back-Channel message Indication (BCI) 

BCI is a 1- or 2-bit variable-length codeword 
present only during the optional reference pic- 
ture selection mode. “1” indicates the presence 
of the optional back-channel message (BCM) 
field. “01” indicates the absence or the end of 
the back-channel message field. BCM and BCI 
may be repeated when present. 


Back-Channel Message (BCM) 
The variable-length back-channel message is 
present if the preceding BCI field is set to “1.” 


(RPRP) 

A variable-length field present only during the 
optional reference picture resampling mode. 
This field carries the parameters of the refer- 
ence picture resampling mode. 


Baseline H.263 Optional Modes 


Unrestricted Motion Vector Mode 
In this baseline H.263 optional mode, motion 
vectors are allowed to point outside the pic- 
ture. The edge samples are used as prediction 
for the “non-existing” samples. The edge sam- 
ple is found by limiting the motion vector to 
the last full sample position within the picture 
area. Motion vector limiting is done separately 
for the horizontal and vertical components. 
Additionally, this mode includes an exten- 
sion of the motion vector range so that larger 
motion vectors can be used (Tables 10.22 and 
10.23). These longer motion vectors improve 
the coding efficiency for the larger picture for- 
mats, such 4CIF or 16CIE A significant gain is 
also achieved for the other picture formats if 
there is movement along the picture edges, 
camera movement, or background movement. 
When this mode is employed within H.263 
version 2, new reversible variable-length codes 
(RVLCs) are used for encoding the motion vec- 
tors, as shown in Table 10.24. These codes are 
single-valued, as opposed to the baseline dou- 
ble-valued VLCs. The double-valued codes 
were not popular due to limitations in their 
extensibility and their high cost of implementa- 
tion. The RVLCs are also easier to implement. 
Each row in Table 10.24 represents a 
motion vector difference in halfpixel units. 
“...X1Xq” denotes all bits following the leading 
“1” in the binary representation of the absolute 
value of the motion vector difference. The “s” 
bit denotes the sign of the motion vector differ- 
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Absolute Value of Motion Vector Code 
Difference in Half-Pixel Units 

0 1 
1 0s0 
“xo” + 2 (2-3) Oxo1s0 
“x1X0” + 4 (4-7) Oxy 1x1s0 
“xoX1X0” + 8 (8-15) 0x91x11xp1s0 
“xgX0X1X9” + 16 (16-31) 0x3 1xX91x;1x91s0 
“x4X3X0X1X9” + 32 (32-63) 0x41x31x91x;1x91s0 
“x5X4X3X9X1X0” + 64 (64-127) 0x51x41x31x91x;1x91s0 
“X@X5X4X3X9X1X0” + 128 (128-255) 0xg1x51x41x31x91x11x91s0 
“x7XgX5X4X3X0X1X0” + 256 (256-511) 0x71xg1x51x41x31x91x,1x91s0 
“xgX7X6X5X4X3X2X1X9” + 512 (512-1023) 0xg1x71xg1x51x41x31x91x,1xo1s0 
“xoXgX7X6X5X4X3X9X1XQ” + 1024 (1024-2047) 0X91xXg1x71xg1x51x41x31x91x,1x91s0 
“x1 0X9XgX7X6X5X4X3X2X1X0” + 2048 (2048-4095) Oxy 91X91xg1x71xg1x51x41x3 1x91x,1x91s0 


Table 10.24. H.263 Reversible Variable-Length Codes for Motion Vectors. 


ence: “0” for positive and “1” for negative. The 
binary representation of the motion vector dif- 
ference is interleaved with bits that indicate if 
the code continues or ends. The “0” in the last 
position indicates the end of the code. 

RVLCs can also be used to increase resil- 
ience to channel errors. Decoding can be per- 
formed by processing the motion vectors in 
the forward and reverse directions. If an error 
is detected while decoding in one direction, 
the decoder can proceed in the reverse direc- 
tion, improving the error resilience of the bit 
stream. In addition, the motion vector range is 
extended up to [-256, +255.5,] depending on 
the picture size. 


Syntax-based Arithmetic Coding Mode 

In this baseline H.263 optional mode, the vari- 
able-length coding is replaced with arithmetic 
coding. The SNR and reconstructed pictures 
will be the same, but the bit rate can be 
reduced by about 5% since the requirement of 
a fixed number of bits for information is 
removed. 

The syntax of the picture, group of blocks, 
and macroblock layers remains exactly the 
same. The syntax of the block layer changes 
slightly in that any number of TCOEF entries 
may be present. 

It is worth noting that use of this mode is 
not widespread. 
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Advanced Prediction Mode 

In this baseline H.263 optional mode, four 
motion vectors per macroblock (one for each Y 
block) are used instead of one. In addition, 
overlapped block motion compensation (OBMC) 
is used for the Y blocks of P frames. 

If one motion vector is used for a macro- 
block, it is defined as four motion vectors with 
the same value. If four motion vectors are used 
for a macroblock, the first motion vector is the 
MVD codeword and applies to Y, in Figure 
10.9. The second motion vector is the MVD2 
codeword that applies to Ys, the third motion 
vector is the MVD3 codeword that applies to 
Y3, and the fourth motion vector is the MVD, 
codeword that applies to Y4. The motion vector 
for Cb and Cr of the macroblock is derived 
from the four Y motion vectors. 

The encoder has to decide which type of 
vectors to use. Four motion vectors use more 
bits, but provide improved prediction. This 
mode improves inter-picture prediction and 
yields a significant improvement in picture 
quality for the same bit rate by reducing block- 
ing artifacts. 


PB Frames Mode 

Like MPEG, baseline H.263 optionally sup- 
ports PB frames. A PB frame consists of one P 
frame (predicted from the previous P frame) 
and one B frame (bi-directionally predicted 
from the previous and current P frame), as 
shown in Figure 10.12. 

With this coding option, the picture rate 
can be increased without substantially increas- 
ing the bit rate. However, an improved PB 
frames mode is supported in H.263 version 2. 
This original PB frames mode is retained only 
for purposes of compatibility with systems 
made prior to the adoption of H.263 version 2. 


H.263 Version 2 Optional Modes 


Continuous Presence Multipoint and 
Video Multiplex Mode 

In this H.263 version 2 optional mode, up to 
four independent H.263 bitstreams can be mul- 
tiplexed into a single bitstream. The sub-bit- 
stream with the lowest identifier number (sent 
via the SBI field) is considered to have the 
highest priority unless a different priority con- 
vention is established by external means. 

This feature is designed for use in continu- 
ous presence multipoint application or other 
situations in which separate logical channels 
are not available, but the use of multiple video 
bitstreams is desired. It is not to be used with 
H.324. 


Forward Error Correction Mode 
This H.263 version 2 optional mode provides 
forward error correction (code and framing) 
for transmission of H.263 video data. It is not to 
be used with H.324. 

Both the framing and the forward error 
correction code are the same as in H.261. 


Advanced Intra Coding Mode 

This H.263 version 2 optional mode improves 
compression for intra macroblocks. It uses 
intra-block prediction from neighboring intra 
blocks, a modified inverse quantization of intra 
DCT coefficients, and a separate VLC table for 
intra coefficients. This mode _ significantly 
improves the compression performance over 
the intra coding of baseline H.263. 

An additional 1- or 2-bit variable-length 
codeword, INTRA_MODE, is added to the 
macroblock layer immediately following the 
MCBPC field to indicate the prediction mode: 
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PB FRAME 


BI-DIRECTIONAL 
PREDICTION 


We Tn 


PREDICTION 


BI-DIRECTIONAL (B) FRAME 


[ PREDICTED (P) FRAME 


Figure 10.12. Baseline H.263 PB Frames. 


“0” = DC only 
“10” = Vertical DC and AC 
“11” = Horizontal DC and AC 


For intra-coded blocks, if the prediction 
mode is DC only, the zig-zag scan order in Fig- 
ure 7.50 is used. If the prediction mode is verti- 
cal DC and AC, the zig-zag scan order in 
Figure 7.51 is used. If the prediction mode is 
horizontal DC and AC, the zig-zag scan order 
in Figure 7.52 is used. 

For non-intra blocks, the zig-zag scan 
order in Figure 7.50 is used. 


Deblocking Filter Mode 

This H.263 version 2 optional mode introduces 
a deblocking filter inside the coding loop. The 
filter is applied to the edge boundaries of 8 x 8 
blocks to reduce blocking artifacts. 

The filter coefficients depend on the mac- 
roblock’s quantizer step size, with larger coef 
ficients used for a coarser quantizer. This 
mode also allows the use of four motion vec- 
tors per macroblock, as specified in the 
advanced prediction mode, and also allows 
motion vectors to point outside the picture, as 
in the unrestricted motion vector mode. The 


computationally expensive overlapping motion 
compensation operation of the advanced pre- 
diction mode is not used so as to keep the com- 
plexity of this mode minimal. 

The result is better prediction and a reduc- 
tion in blocking artifacts. 


Slice Structured Mode 
In this H.263 version 2 optional mode, a slice 
layer is substituted for the GOB layer. This 
mode provides error resilience, makes the bit- 
stream easier to use with a packet transport 
delivery scheme, and minimizes video delay. 
The slice layer consists of a slice header 
followed by consecutive complete macro- 
blocks. Two additional modes can be signaled 
to reflect the order of transmission (sequential 
or arbitrary) and the shape of the slices (rect- 
angular or not). These add flexibility to the 
slice structure so that it can be designed for 
different applications. 


Supplemental Enhancement Information 
With this H.263 version 2 optional mode, addi- 
tional supplemental information may be 
included in the bitstream to signal enhanced 
display capability. 


H.263 491 


Typical enhancement information can sig- 
nal full- or partial-picture freezes, picture 
freeze releases, or chroma keying for video 
compositing. 

The supplemental information may be 
present in the bitstream even though the 
decoder may not be capable of using it. The 
decoder simply discards the supplemental 
information, unless a requirement to support 
the capability has been negotiated by external 
means. 


Improved PB Frames Mode 

This H.263 version 2 optional mode represents 
an improvement compared to the baseline 
H.263 PB frames option. This mode permits 
forward, backward, and bi-directional predic- 
tion for B frames in a PB frame. The operation 
of the MODB field changes are shown in Table 
10.25 

Bi-directional prediction methods are the 
same in both PB frames modes except that, in 
the improved PB frames mode, no delta vector 
is transmitted. 

In for ward prediction, the B macroblock is 
predicted from the previous P macroblock, and 
a separate motion vector is then transmitted. 

In backwards prediction, the predicted 
macroblock is equal to the future P macro- 
block, and therefore no motion vector is trans- 
mitted. 

Improved PB frames are less susceptible 
to changes that may occur between frames, 
such as when there is a scene cut between the 
previous P frame and the PB frame. 


Reference Picture Selection Mode 

In baseline H.263, a frame may be predicted 
from the previous frame. If a portion of the ref- 
erence frame is lost due to errors or packet 
loss, the quality of future frames is degraded. 


Using this H.263 version 2 optional mode, it is 
possible to select which reference frame to use 
for prediction, minimizing error propagation. 

Four back-channel messaging signals 
(NEITHER, ACK, NACK and ACK+NACK) are 
used by the encoder and decoder to specify 
which picture segment will be used for predic- 
tion. For example, a NACK sent to the encoder 
from the decoder indicates that a given frame 
has been degraded by errors. Thus, the 
encoder may choose not to use this frame for 
future prediction, and instead use a different, 
unaffected, reference frame. This reduces 
error propagation, maintaining improved pic- 
ture quality in error-prone environments. 


Temporal, SNR and Spatial Scalability 
Mode 

In this H.263 version 2 optional mode, there is 
support for temporal, SNR, and spatial scalabil- 
ity. Scalability allows for the decoding of a 
sequence at more than one quality level. This 
is done by using a hierarchy of pictures and 
enhancement pictures partitioned into one or 
more layers. The lowest layer is called the base 
layer. 

The base layer is a separately decodable 
bitstream. The enhancement layers can be 
decoded in conjunction with the base layer to 
increase the picture rate, increase the picture 
quality, or increase the picture size. 

Temporal scalability is achieved using bi- 
directionally predicted pictures, or B frames. 
They allow prediction from either or both a 
previous and subsequent picture in the base 
layer. This results in improved compression as 
compared to that of P frames. These B frames 
differ from the B-picture part of a PB frame or 
improved PB frame in that they are separate 
entities in the bitstream. 
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CBPB MVDB Code Coding Mode 
0 bi-directional prediction 
x 10 bi-directional prediction 
110 forward prediction 
x x 1110 forward prediction 
11110 backward prediction 
x 111111 backward prediction 


Table 10.25. H.263 Variable-Length Code Table for MODB for Improved PB Frames Mode. 


SNR scalability refers to enhancement 
information that increases the picture quality 
without increasing resolution. Since compres- 
sion introduces artifacts, the difference 
between a decoded picture and the original is 
the coding error. Normally, the coding error is 
lost at the encoder and never recovered. With 
SNR scalability, the coding errors are sent to 
the decoder, enabling an enhancement to the 
decoded picture. The extra data serves to 
increase the signal-to-noise ratio (SNR) of the 
picture, hence the term “SNR scalability.” 

Spatial scalability is closely related to SNR 
scalability. The only difference is that before 
the picture in the reference layer is used to 
predict the picture in the spatial enhancement 
layer, it is interpolated by a factor of two either 
horizontally or vertically (1D spatial scalabil- 
ity), or both horizontally and vertically (2D 
spatial scalability). Other than the upsampling 
process, the processing and syntax for a spatial 
scalability picture is the same as for a SNR 
scalability picture. 


Since there is very little syntactical distinc- 
tion between frames using SNR scalability and 
frames using spatial scalability, the frames 
used for either purpose are called EI frames 
and EP frames. 

The frame in the base layer which is used 
for upward prediction in an EI or EP frame 
may be an I frame, a P frame, the P-part of a PB 
frame, or the P-part of an improved PB frame 
(but not a B frame, the B-part of a PB frame, or 
the B-part an improved PB frame). 

This mode can be useful for networks hav- 
ing varying bandwidth capacity. 


Reference Picture Resampling Mode 

In this H.263 version 2 optional mode, the ref- 
erence frame is resampled to a different size 
prior to using it for prediction. 

This allows having a different source refer- 
ence format than the frame being predicted. It 
can also be used for global motion estimation, 
or estimation of rotating motion, by warping 
the shape, size and location of the reference 
frame. 
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Reduced Resolution Update Mode 

An H.263 version 2 optional mode is provided 
which allows the encoder to send update infor- 
mation for a frame encoded at a lower resolu- 
tion, while still maintaining a higher resolution 
for the reference frame, to create a final frame 
at the higher resolution. 

This mode is best used when encoding a 
highly active scene, allowing an encoder to 
increase the frame rate for moving parts of a 
scene, while maintaining a higher resolution in 
more static areas of the scene. 

The syntax is the same as baseline H.263, 
but interpretation of the semantics is different. 
The dimensions of the macroblocks are dou- 
bled, so the macroblock data size is one-quar- 
ter of what it would have been without this 
mode enabled. Therefore, motion vectors must 
be doubled in both dimensions. To produce 
the final picture, the macroblock is upsampled 
to the intended resolution. After upsampling, 
the full resolution frame is added to the 
motion-compensated frame to create the full 
resolution frame for future reference. 


Independent Segment Decoding Mode 

In this H.263 version 2 optional mode, picture 
segment boundaries are treated as picture 
boundaries—no data dependencies across seg- 
ment boundaries are allowed. 

Use of this mode prevents the propagation 
of errors, providing error resilience and recov- 
ery. This mode is best used with slice layers, 
where, for example, the slices can be sized to 
match a specific packet size. 


Alternative Inter VLC Mode 

The intra VLC table used in the advanced intra 
coding mode can also be used for inter block 
coding when this H.263 version 2 optional 
mode is enabled. 


Large quantized coefficients and small 
runs of zeros, typically present in intra blocks, 
become more frequent in inter blocks when 
small quantizer step sizes are used. When bit 
savings are obtained, and the use of the intra 
quantized DCT coefficient table can be 
detected at the decoder, the encoder will use 
the intra table. The decoder will first try to 
decode the quantized coefficients using the 
inter table. If this results in addressing coeffi- 
cients beyond the 64 coefficients of the 8 x 8 
block, the decoder will use the intra table. 


Modified Quantization Mode 

This H.263 version 2 optional mode improves 
the bit rate control for encoding, reduces CbCr 
quantization error, expands the range of DCT 
coefficients, and places certain restrictions on 
coefficient values. 

In baseline H.263, the quantizer value may 
be modified at the macroblock level. However, 
only a small adjustment (+1 or +2) in the value 
of the most recent quantizer is permitted. The 
modified quantization mode allows the modifi- 
cation of the quantizer to any value. 

In baseline H.263, the Y and CbCr quantiz- 
ers are the same. The modified quantization 
mode also increases CbCr picture quality by 
using a smaller quantizer step size for the Cb 
and Cr blocks relative to the Y blocks. 

In baseline H.263, when a quantizer 
smaller than 8 is employed, quantized coeffi- 
cients exceeding the range of [-127, +127] are 
clipped. The modified quantization mode also 
allows coefficients that are outside the range of 
[-127, +127] to be represented. Therefore, 
when a very fine quantizer step size is 
selected, an increase in Y quality is obtained. 
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H.263 Version 2 Levels 


To enable systems to have a higher probability 
of connecting using other than baseline H.263, 
several preferred modes of operation are 
defined. Each level includes support for the 
levels below it. 


Level 3 Preferred Modes 
e Advanced Prediction 
¢ Improved PB Frames 
¢ Independent Segment Decoding 


e Alternative Inter VLC 


Level 2 Preferred Modes 
e Unrestricted Motion Vector 
e Slice Structured 


e Reference Picture Resampling 
(implicit factor-of-4 mode only) 


Level 1 Preferred Modes 
e Advanced Intra Coding 
¢ Deblocking Filter 


¢ Supplemental Enhancement Information 
(full-frame freeze only) 


¢ Modified Quantization 


H.263++ 


Under development at the time of printing, 
H.263++ will add three additional options to 
H.263 version 2. 

An optional Enhanced Reference Picture 
Selection (ERPS) mode will offer enhanced 
coding efficiency and error resilience. It man- 
ages a multi-picture buffer of stored pictures. 

An optional Data Partitioned Slice (DPS) 
mode will offer enhanced error resilience. It 
separates the header and motion vector data 


from the DCT coefficient data and protects the 
motion vector data by using a reversible repre- 
sentation. 

An optional Additional Supplemental 
Enhancement Information which can be added 
to a H.263 bitstream to provide backward-com- 
patible enhancements, such as: 


(a) Indication of using a specific fixed- 
point IDCT 


(b) Picture Messages, including message 
types: 
Arbitrary binary data 


e Text (arbitrary, copyright, caption, video de- 
scription, or Uniform Resource Identifier) 


Picture header repetition (current, previous, 
next with reliable temporal reference, or next 
with unreliable temporal reference) 


Interlaced field indications (top or bottom) 


e Spare reference picture identification 
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Consumer DV 


The consumer DV (digital video) format is 
used by consumer digital camcorders, and is 
based on IEC 61834 (25 Mbps data rate) and 
the newer SMPTE 314M specification (25 or 
50 Mbps data rate). The compression algo- 
rithm used is neither motion-JPEG nor MPEG, 
although it shares much in common with 
MPEG I frames. A proprietary compression 
algorithm is used that can be edited since it is 
an intra-frame technique. 

The digitized video is stored in memory 
before compression is done. The correlation 
between the two fields stored in the buffer is 
measured. If the correlation is low, indicating 
inter-field motion, the two fields are individu- 
ally compressed. Normally, the entire frame is 
compressed. In either case, DCT-based com- 
pression is used. 

Video data is compressed about 5:1, 
depending on the amount of motion within a 
frame. To achieve a constant 25 or 50 Mbps bit 
rate, DV uses adaptive quantization, which 
uses the appropriate DCT quantization table 
for each frame. 


Figure 11.1 illustrates the contents of one 
track as written on tape. The JTI sector (insert 
and track information) contains information on 
track status and serves in place as a conven- 
tional control track during video editing. 

The audio sector, shown in Figure 11.2, 
contains both audio data and auxiliary audio 
data (AAUX). 

The video sector, shown in Figure 11.3, con- 
tains video data and auxiliary video data 
(VAUX). VAUX data includes recording date 
and time, lens aperture, shutter speed, color 
balance, and other camera settings. 

The subcode sector stores a variety of infor- 
mation, including timecode, teletext, closed 
captioning in multiple languages, subtitles and 
karaoke lyrics in multiple languages, titles, 
table of contents, chapters, etc. The subcode 
sector, AAUX data, and VAUX data use 5-byte 
blocks of data called packs. 

Chapter 6 reviews the transmission of DV 
over IEEE 1394 (IEC 61883), which is a com- 
mon interface for digital camcorders. This 
chapter reviews the transmission of DV over 
SDTI (SMPTE 221M and 222M), which is 
aimed at the pro-video environment. 
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TI EDIT AUDIO AUDIO AUDIO EDIT 
SECTOR GAP PREAMBLE SECTOR POSTAMBLE GAP 
625 BITS 500 BITS 14 SYNC BLOCKS 550 BITS 700 BITS 
VIDEO VIDEO VIDEO EDIT 
PREAMBLE SECTOR POSTAMBLE GAP 
500 BITS 149 SYNC BLOCKS 975 BITS 1550 BITS 
SUBCODE 
SUBCODE SUBCODE OVERWRITE 
POSTAMBLE 
PREAMBLE SECTOR MARGIN 
1325 BITS 
1200 BITS 12 SYNC BLOCKS 1250 BITS 
(1200 BITS) 


Figure 11.1. Sector Arrangement for One Track for a 480-Line System. The total bits per track, 
excluding the overwrite margin, is 134,975 (134,850). There are 10 (12) of these tracks per 
video frame. 576-line system parameters (if different) are shown in parentheses. 


SYNC 
BLOCK 
NUMBER 
INNER 
SYNC ID AAUX DATA AUDIO DATA 
0) PARITY 
2 BYTES | 3 BYTES 5 BYTES 72 BYTES 
8 BYTES 
INNER 
SYNC ID AAUX DATA AUDIO DATA 
1 PARITY 
2 BYTES | 3 BYTES 5 BYTES 72 BYTES 
8 BYTES 
INNER 
SYNC ID AAUX DATA AUDIO DATA 
8 PARITY 
2 BYTES | 3 BYTES 5 BYTES 72 BYTES 
8 BYTES 
INNER 
SYNC ID 
9 OUTER PARITY PARITY 
2 BYTES | 3 BYTES 
8 BYTES 
INNER 
13 SNe i OUTER PARITY PARITY 
2 BYTES | 3 BYTES 
8 BYTES 


Figure 11.2. Structure of Sync Blocks in an Audio Sector. 
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SYNC ID VAUX DATA Nee 
0) PARITY 
2 BYTES | 3 BYTES 77 BYTES 
8 BYTES 
INNER 
SYNC ID VAUX DATA 
1 PARITY 
2 BYTES | 3 BYTES 77 BYTES 
8 BYTES 
INNER 
SYNC ID VIDEO DATA 
2 PARITY 
2 BYTES | 3 BYTES 77 BYTES 
8 BYTES 
INNER 
SYNC ID VIDEO DATA 
136 PARITY 
2 BYTES | 3 BYTES 77 BYTES 
8 BYTES 
INNER 
SYNC ID VAUX DATA 
137 PARITY 
2 BYTES | 3 BYTES 77 BYTES 
8 BYTES 
INNER 
SYNC ID 
138 OUTER PARITY PARITY 
2 BYTES | 3 BYTES 
8 BYTES 
INNER 
SYNC ID 
148 OUTER PARITY PARITY 
2 BYTES | 3 BYTES 
8 BYTES 


Figure 11.3. Structure of Sync Blocks in a Video Sector. 


Audio 


An audio frame starts with an audio sample 
within -50 samples of the beginning of line 1 
(480-line systems) or the middle of line 623 
(576-line systems). 

Each track contains 9 audio sync blocks, 
with each audio sync block containing 5 bytes 
of audio auxiliary data (AAUX) and 72 bytes of 
audio data, as illustrated in Figure 11.2. Audio 
samples are shuffled over tracks and data-sync 
blocks within a frame. The remaining 5 audio 
sync blocks are used for error correction. 

Two 44.1-kHz, 16-bit channels require a 
data rate of about 1.64 Mbps. Four 32-kHz, 12- 


bit channels require a data rate of about 1.536 
Mbps. Two 48-kHz, 16-bit channels require a 
data rate of about 1.536 Mbps. 

IEC 61834 


IEC 61834 supports a variety of audio sampling 
rates: 


48 kHz (16 bits, 2 channels) 
44.1 kHz (16 bits, 2 channels) 
32 kHz (16 bits, 2 channels) 
32 kHz (12 bits, 4 channels) 
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Audio sampling may be either locked or 
unlocked to the video frame frequency. 

Audio data is processed in frames. At a 
locked 48 kHz sample rate, each frame con- 
tains either 1600 or 1602 audio samples (480- 
line system) or 1920 audio samples (576-line 
system). For the 480-line system, the number 
of audio samples per frame follows a five-frame 
sequence: 


1600, 1602, 1602, 1602, 1602, 1600 ... 


With a locked 32 kHz sample rate, each 
frame contains either 1066 or 1068 audio sam- 
ples (480-line system) or 1280 audio samples 
(576-line system). For the 480-line system, the 
number of audio samples per frame follows a 
fifteen-frame sequence: 


1066, 1068, 1068, 1068, 1068, 1068, 1068, 
1066, 1068, 1068, 1068, 1068, 1068, 1068, 
1068, ... 


For unlocked audio sampling, there is no 
exact number of audio samples per frame, 
although minimum and maximum values are 
specified. 


SMPTE 314M 


SMPTE 314M supports a more limited option, 
with audio sampling locked to the video frame 
frequency: 


48 kHz (16 bits, 2 channels) for 25 Mbps 
48 kHz (16 bits, 4 channels) for 50 Mbps 


Audio data is processed in frames. At a 
locked 48 kHz sample rate, each frame con- 
tains either 1600 or 1602 audio samples (480- 
line system) or 1920 audio samples (576-line 


system). For the 480-line system, the number 
of audio samples per frame follows a five-frame 
sequence: 


1600, 1602, 1602, 1602, 1602, 1600 ... 


The audio capacity is capable of 1620 sam- 
ples per frame for the 480-line system or 1944 
samples per frame for the 576-line system. The 
unused space at the end of each frame is filled 
with arbitrary data. 


Audio Auxiliary Data (AAUX) 


AAUX information is added to the shuffled 
audio data as shown in Figure 11.2. The AAUX 
pack includes a 1-byte pack header and 4 bytes 
of data (payload), resulting in a 5-byte AAUX 
pack. Since there are nine of them per video 
frame, they are numbered from 0 to 8. An 
AAUX source (AS) pack and an AAUX source 
control (ASC) pack must be included in the 
compressed stream. Only the AS and ASC 
packs are currently supported by SMPTE 
314M, although IEC 61834 supports many 
other pack formats. 


AAUX Source (AS) Pack 
The format for this pack is shown in Table 
11.1. 


LF Locked audio sample rate 
“Q” = locked to video 
“1” = unlocked to video 


AF Audio frame size. Specifies the 
number of audio samples per 
frame. 
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SM 


PA 


IEC 61834 D7 D6 D5 D4 D3 D2 D1 DO 
PCO 0 1 0 1 0 0 0 0 
PC1 LF 1 AF 
PC2 SM CHN PA AM 
PC3 1 ML 50/60 ST 
PC4 EF TC SMP QU 

SMPTE 314M D7 D6 D5 D4 D3 D2 D1 DO 
PCO 0 1 0 1 0 0 0 0 
PC1 LF 1 AF 
PC2 0 CHN 1 AM 
PC3 1 1 50/60 ST 
PC4 1 1 SMP QU 


Table 11.1. AAUX Source (AS) Pack. 


Stereo mode CHN 
“0” = multi-stereo audio 
“1” = lumped audio 


Specifies if the audio signals 
recorded in CH1 (CH3) are 
related to the audio signals 
recorded in CH2 (CH4) AM 
“0” = one of pair channels 
“1” = independent channels 
ML 


Number of audio channels within 
an audio block 

“00” = one channel per block 
“01” = two channels per block 
“10” = reserved 

“11” = reserved 


Specifies the content of the audio 
signal on each channel 


Multi-language flag 

“0” = recorded in multi-language 

“1” = not recorded in multi- 
language 
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50/60 


ST 


EF 


TC 


SMP 


QU 


50- or 59.94-Hz video system 
“0” = 59.94-Hz field system 
“1” = 50-Hz field system 


For SMPTE 314M, this specifies 
the number of audio blocks per 
frame. 

“00000” = 2 audio blocks 

“00001” = reserved 

“00010” = 4 audio blocks 

“00011” to “11111” = reserved 


For IEC 61834, this specifies the 
video system. 

“00000” = standard definition 
“00010” = reserved 

“00010” = high definition 
“00011” to “11111” = reserved 


Audio emphasis flag 
“Q” = on 
“ 1 ” = off 


Emphasis time constant 
“1” = 50/15 ps 
“0” = reserved 


Audio sampling frequency 


“000” = 48 kHz 
“001” = 44.1 kHz 
“010” = 32 kHz 


“011” to “111” = reserved 


Audio quantization 

“000” = 16 bits linear 
“001” = 12 bits nonlinear 
“010” = 20 bits linear 
“011” to “111” = reserved 


AAUX Source Control (ASC) Pack 


The format for this pack is shown in Table 


11.2. 


CGMS 


ISR 


CMP 


SS 


Copy generation management 


system 
“00” = copying permitted without 
restriction 


“01” = reserved 
“10” = one copy permitted 
“11” = no copy permitted 


Previous input source 
“00” = analog input 
“01” = digital input 
“10” = reserved 

“11” = no information 


Number of times of compression 
“00” = once 

“01” = twice 

“10” = three or more 

“11” = no information 


Source and recorded situation 

“00” = scrambled source with 
audience restrictions and 
recorded without 
descrambling 

“01” = scrambled source without 
audience restrictions and 
recorded without 
descrambling 

“10” = source with audience 
restrictions or descrambled 
source with audience 
restrictions 

“11” = no information 


Audio 501 
IEC 61834 D7 D6 D5 D4 D3 D2 D1 Do 
PCO 0 1 0 1 0 0 0 1 
PCl CGMS ISR CMP Ss 
PC2 RECS | RECE REC M ICH 
PC3 DRF SPD 
PC4 1 GEN 
SMPTE 314M D7 D6 D5 D4 D3 D2 D1 Do 
PCO 0 1 0 1 0 0 0 1 
PC1 CGMS 1 1 1 1 EFC 
PC2 RECS | RECE | FADES | FADEE 1 1 1 1 
PC3 DRF SPD 
PC4 1 1 1 1 1 1 1 1 
Table 11.2. AAUX Source Control (ASC) Pack. 
EFC Audio emphasis flags REC M Recording mode 
“00” = emphasis off “001” = original 
“01” = emphasis on “011” = one CH insert 
“10” = reserved “100” = four CHs insert 
“11” = reserved “101” = two CHs insert 
“111” = invalid recording 
RECS _ Recording start point 
“0” = at recording start point FADE S Fading of recording start point 
“1” = not at recording start point “0” = fading off 
“1” = fading on 
RECE Recording end point 


“0” = at recording end point 
“1” = not at recording end point 


FADE E 


Fading of recording end point 
“0” = fading off 
“1” = fading on 
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ICH Insert audio channel 
“000” = CH1 
“001” = CH2 
“010” = CH3 
“011” = CH4 
“100” = CH1, CH2 
“101” = CH3, CH4 
“110” = CH1, CH2, CH3, CH4 
“111” = no information 
DRF Direction flag 


“Q” = reverse direction 
“1” = forward direction 


SP Playback speed. This is defined 
by a 3-bit coarse value plus a 4-bit 
fine value. For normal recording, 
it is set to “0100000.” 

GEN Indicates the category of the 

audio source 


Video 


As shown in Table 11.3, IEC 61834 uses 4:1:1 
YCbCr for 720 x 480 video (Figure 3.5) and 
4:2:0 YCbCr for 720 x 576 video (Figure 3.11). 

SMPTE 314M uses 4:1:1 YCbCr (Figure 
3.5) for both video standards for the 25 Mbps 
implementation. 4:2:2 YCbCr (Figure 3.3) is 
used for both video standards for the 50 Mbps 
implementation. 


DCT Blocks 


The Y, Cb, and Cr samples for one frame are 
divided into 8 x 8 blocks, called DCT blocks. 
Each DCT block, with the exception of the 
right-most DCT blocks for Cb and Cr during 
4:1:1 mode, transform 8 samples x 8 lines of 
video data. Rows 1, 3, 5, and 7 of the DCT 


block process field 1, while rows 0, 2, 4, and 6 
process field 2. 

For 480-line systems, there are either 
10,800 (4:2:2) or 8,100 (4:1:1) DCT blocks per 
video frame. 

For 576-line systems, there are either 
12,960 (4:2:2) or 9,720 (4:1:1, 4:2:0) DCT 
blocks per video frame. 


Macroblocks 


As shown in Figure 11.4, each macroblock in 
the 4:2:2 mode consists of four DCT blocks. As 
shown in Figures 11.5 and 11.6, each macro- 
block in the 4:1:1 and 4:2:0 modes consists of 
six DCT blocks. 

For 480-line systems, the macroblock 
arrangement for one frame of 4:1:1 and 4:2:2 
YCbCr data is shown in Figures 11.7 and 11.8, 
respectively. There are either 2,700 (4:2:2) or 
1,350 (4:1:1) macroblocks per video frame. 

For 576-line systems, the macroblock 
arrangement for one frame of 4:2:0, 4:1:1, and 
4:2:2 YCbCr data is shown in Figures 11.9, 
11.10, and 11.11, respectively. There are either 
3,240 (4:2:2) or 1,620 (4:1:1, 4:2:0) macroblocks 
per video frame. 


Super Blocks 


Each super block consists of 27 macroblocks. 

For 480-line systems, the super block 
arrangement for one frame of 4:1:1 and 4:2:2 
YCbCr data is shown in Figures 11.7 and 11.8, 
respectively. There are either 100 (4:2:2) or 50 
(4:1:1) super blocks per video frame. 

For 576-line systems, the super block 
arrangement for one frame of 4:2:0, 4:1:1, and 
4:2:2 YCbCr data is shown in Figures 11.9, 
11.10, and 11.11, respectively. There are either 
120 (4:2:2) or 60 (4:1:1, 4:2:0) super blocks per 
video frame. 
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Compression 


Like MPEG and H.263, DV uses DCT-based 
video compression. However, in this case, DCT 
blocks are comprised from two fields, with 
each field providing samples from 4 scan lines 
and 8 horizontal samples. 

Two DCT modes, called 8-8-DCT and 2-4-8- 
DCT, are available for the transform process, 
depending upon the degree of content varia- 
tion between the two fields of a video frame. 
The 8-8-DCT is your normal 8 x 8 DCT, and is 
used when there a high degree of correlation 
(little motion) between the two fields. The 2-4- 
8-DCT uses two 4 x 8 DCTs (one for each 
field), and is used when there is a low degree 
of correlation (lots of motion) between the two 
fields. Which DCT is used is stored in the DC 
coefficient area using a single bit. 

The DCT coefficients are quantized to 9 
bits, then divided by a quantization number so 


as to limit the amount of data in one video seg- 
ment to five compressed macroblocks. 

Each DCT block is classified into one of 
four classes based on quantization noise and 
maximum absolute values of the AC coeffi- 
cients. The 2-bit class number is stored in the 
DC coefficient area. 

An area number is used for the selection of 
the quantization step. The area number, of 
which there are four, is based on the horizontal 
and vertical frequencies. 

The quantization step is decided by the 
class number, area number, and quantization 
number (QNO). Quantization information is 
passed in the DIF header of video blocks. 

Variable-length coding converts the quan- 
tized AC coefficients to variable-length codes. 

Figures 11.12 and 11.13 illustrate the 
arrangement of compressed macroblocks. 


Parameters 480 Active Lines 576 Active Lines 
active resolution (Y) 720 x 480 720 x 576 
frame refresh rate 29.97 Hz 25 Hz 
YCbCr sampling structure 
IEC 61834 4:1:1 4:2:0 
SMPTE 314M 4:1:1, 4:2:2 4:1:1, 4:2:2 


form of YCbCr coding 


Uniformly quantized PCM, 8 bits per sample. 


active line numbers 


23-262, 285-524 


23-310, 335-622 


Table 11.3. IEC 61834 and SMPTE 314M YCbCr Parameters. 
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CB DCT 3 
CR DCT 2 
¥: 
DCT 0 DCT 1 
LEFT RIGHT 


Figure 11.4. 4:2:2 Macroblock Arrangement. 


CB DCT5 
CR DCT 4 
TOP 
DCT 0 DCT 1 
CB DCT5 
CR DCT 4 
Y 
Y 
DCT O DCT 1 DCT 2 DCT 3 DCT 2 DCT 3 
BOTTOM 
LEFT RIGHT LEFT RIGHT 
EXCEPT FOR RIGHT-MOST MACROBLOCK FOR RIGHT-MOST MACROBLOCK 


Figure 11.5. 4:1:1 Macroblock Arrangement. 


CB DCT 5 
CR DCT 4 

TOP 

DCT O DCT 1 
¥ 

DCT 2 DCT 3 
BOTTOM 

LEFT RIGHT 


Figure 11.6. 4:2:0 Macroblock Arrangement. 
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480 
LINES 


720 SAMPLES 


~< = 
SUPERBLOCK 
(0) 1 2 3 4 
0 S0,0 S0,1 $0,2 S0,3 S0,4 
1 $1,0 $1,1 $1,2 $1,3 $1,4 
2 $2,0 $2,1 $2,2 $2,3 $2,4 
3 $3,0 S3,1 $3,2 $3,3 $3,4 
4 $4,0 $4,1 $4,2 $4,3 $4,4 
5 $5,0 $5,1 $5,2 $5,3 $5,4 
6 S6,0 S6,1 $6,2 S6,3 S6,4 
7 S7,0 S7,1 $7,2 S7,3 S7,4 
8 $8,0 $8,1 $8,2 $8,3 $8,4 
9 $9,0 S9,1 $9,2 S9,3 S9,4 


ef7pele | 


4 


14 


MACROBLOCK 


Figure 11.7. Relationship Between Super Blocks and Macroblocks (4:1:1 YCbCr, 720 x 480). 
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720 SAMPLES 


<= > 
SUPERBLOCK 
0 1 2 3 4 
4 0 80,0 $0,1 $0,2 $0,3 $0,4 
1 $1,0 Sif $1.2 $1,3 $14 
2 $2,0 $21 $2,2 $2,3 $2,4 
3 $3,0 $31 $3,2 $3,3 $3,4 
4 $4,0 $41 $4,2 $43 84,4 
5 $5,0 $5.1 $5,2 $5,3 $5,4 
6 $6,0 $6.1 $6,2 $6,3 86,4 
7 87,0 S71 87,2 87,3 87,4 
8 $8,0 $8.1 $8,2 $8,3 $8,4 
460 9 $9,0 $91 $9,2 $93 89,4 
LINES 10 $10,0 $10,1 $10,2 $10,3 $10,4 
1 $11,0 Sti $11,2 $11,3 $11,4 
12 $12,0 $121 $12,2 $12,3 $12,4 
13 $13,0 $13,1 $13,2 $13,3 $13,4 
14 $14,0 $141 $14,2 $143 $14,4 
15 $15,0 $151 $15,2 $15,3 $15,4 
16 $16,0 $16,1 $16,2 $163 $16.4 
17 $17,0 S171 $17,2 $17,3 S17,4 
18 $18,0 $18,1 $18,2 $18,3 $18,4 
' 19 $19,0 $19,1 $19,2 $19,3 $19,4 


MACROBLOCK 


Figure 11.8. Relationship Between Super Blocks and Macroblocks (4:2:2 YCbCr, 720 x 480). 
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720 SAMPLES 


< > 
SUPERBLOCK 
(0) 1 2 3 4 
4 0 S0,0 S0,1 S0,2 S0,3 S0,4 
1 $1,0 S11 $1,2 $1,3 $1.4 
2 $2,0 $2,1 $2,2 $2,3 $2,4 
3 $3,0 S31 $3,2 $3,3 S3,4 
4 $4,0 S4,1 S$4,2 $4,3 S4,4 
576 5 $5,0 S5,1 $5,2 $5,3 S5,4 
LINES 6 $6,0 S6,1 $6,2 S6,3 S6,4 
7 $7,0 S7,1 $7,2 $7,3 S7,4 
8 $8,0 S8,1 S8,2 $8,3 $8,4 
9 $9,0 S9,1 S9,2 $9,3 S9,4 
10 $10,0 $10,1 $10,2 $10,3 $10,4 
y 11 $11,0 $11,1 $11,2 $11,3 $11,4 
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Figure 11.9. Relationship Between Super Blocks and Macroblocks (4:2:0 YCbCr, 720 x 576). 
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Figure 11.10. Relationship Between Super Blocks and Macroblocks (4:1:1 YCbCr, 720 x 576). 
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Figure 11.11. Relationship Between Super Blocks and Macroblocks (4:2:2 YCbCr, 720 x 576). 
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COMPRESSED MACROBLOCK 
pe 


YO (14 BYTES) 12 BYTES Y1 (14 BYTES) 12 BYTES CR (10 BYTES) CB (10 BYTES) 
ee ~< <4 > ~t <4 >< > 


DCT MODE (1 BIT) 
CLASS NUMBER (2 BITS) 


XO AND X1 DATA AREA = 2 BYTES EACH 


Figure 11.12. 4:2:2 Compressed Macroblock Arrangement. 


COMPRESSED MACROBLOCK 


YO (14 BYTES) Y1 (14 BYTES) Y2 (14 BYTES) Y3 (14 BYTES) CR (10 BYTES) CB (10 BYTES) 
~<t > <4 > <4 > <t > <4 > <4 > 


DCO AC DC1 AC DC2 AC DC3 AC DC4 AC DC5 AC 


DCT MODE (1 BIT) 
CLASS NUMBER (2 BITS) 


Figure 11.13. 4:2:0 and 4:1:1 Compressed Macroblock Arrangement. 
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IEC 61834 D7 D6 D5 D4 D3 D2 D1 DO 

PCO 0 1 1 0 0 0 0 0 
PCl1 TVCH (tens of units, 0-9) TVCH (units, 0-9) 
PC2 B/W EN CLF TVCH (hundreds of units, 0-9) 
PC3 SRC 50/60 ST 
PC4 TUN 

SMPTE 314M D7 D6 D5 D4 D3 D2 D1 DO 
PCO 0 1 1 0 0 0 0 0 
PCl 1 1 1 1 1 1 al 1 
PC2 B/W EN CLF 1 1 1 1 
PC3 1 1 50/60 ST 
PC4 VISC 


Table 11.4. VAUX Source (VS) Pack. 


Video Auxiliary Data (VAUX) 


VAUX information is added to the shuffled 
video data as shown in Figure 11.3. The VAUX 
pack includes a 1-byte pack header and 4 bytes 
of data (payload), resulting in a 5-byte VAUX 
pack. Since there are 45 of them per video 
frame, they are numbered from 0 to 44. A 
VAUX source (VS) pack and an VAUX source 
control (VSC) pack must be included in the 
compressed stream. Only the VS and VSC 
packs are currently supported by SMPTE 
314M, although IEC 61834 supports many 
other pack formats. 


VAUX Source (VS) Pack 

The format for this pack is shown in Table 

11.4. 
TVCH The number of the television 

channel, from 0-999. A value of 

EEEy is reserved for pre- 

recorded tape or a line input. A 

value of FFF y is reserved for “no 

information.” 

B/W _ Black and white flag 

“0” = black and white video 

“1” = color video 
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EN CLF valid flag VAUX Source Control (VSC) Pack 
“0” = CLF is valid The format for this pack is shown in Table 
“1” = CLF is invalid 11.5. 
CLF Color frames identification code CGMS Same as for AAUX 
For 480-line systems: ISR Same as for AAUX 
“00” = color frame A 
“01” = color frame B CMP Same as for AAUX 
“10” = reserved 
“11” = reserved SS Same as for AAUX 
For 576-line systems: RECS Same as for AAUX 
“QO” = 1st, 2nd field 
“01” = 3rd, 4th field REC M Same as for AAUX 
“10” = 5th, 6th field 
“11” = 7th, 8th field BCS Broadcast system. Indicates the 
type information of display 
SRC Defines the input source of the format with DISP 
video signal “00” = type 0 (IEC 61880, EIA- 
608) 
50/60 Sameas for AAUX “01” = type 1 (ETS 300 294) 
“10” = reserved 
ST Same as for AAUX “11” = reserved 
TUN Tuner Category consists of 3-bit DISP Aspect ratio information 
area number and a 5-bit satellite 
number. “11111111” indicates no FF Frame/Field flag. Indicates 
information is available. whether both fields are output in 
order or only one of them is 
VISC output twice during one frame 
period. 


pee = -180 “0” = one field output twice 


“1” = both fields output in order 


“00000000” = 0 

x . FS First/Second flag. Indicates 

ee which field which should be 
01111111” = no information output during field 1 period. 

other values = reserved “0” = field 2 


“1” = field 1 
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FC 


IL 


IEC 61834 D7 D6 D5 D4 D3 D2 D1 DO 
PCO 0 1 1 0 0 0 0 1 
PC1 CGMS ISR CMP SS 
PC2 REC S 1 REC M 1 DISP 
PC3 FF FS FC IL SF Nie BCS 
PC4 1 GEN 

SMPTE 314M D7 D6 D5 D4 D3 D2 D1 DO 
PCO 0 1 1 0 0 0 0 1 
PC1 CGMS 1 1 1 1 1 1 
PC2 1 1 0 0 1 DISP 
PC3 FF FS FC IL 1 1 0 0 
PC4 1 1 1 1 1 1 1 1 


Table 11.5. VAUX Source Control (VSC) Pack. 


Frame change flag. Indicates if 
the picture of the current frame is 
the same picture of the 
immediate previous frame. 

“0” = same picture 

“1” = different picture 


Interlace flag. Indicates if the data 
of two fields which construct one 
frame are interlaced or non- 
interlaced. 

“QO” = noninterlaced 

“1” = interlaced or unrecognized 


SF 


SC 


GEN 


Still-field picture flag. Indicates 
the time difference between the 
two fields within a frame. 

“0” = 0 seconds 

“1” = 1,001/60 or 1/50 second 


Still camera picture flag 
“0” = still camera picture 
“1” = not still camera picture 


Indicates the category of the 
video source 
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Digital Interface 


IEC 61834 and SMPTE 314M both specify the 
data format for a generic digital interface. This 
data format may be sent via IEEE 1394 or 
SDTI, for example. Figure 11.14 illustrates the 
frame data structure. 

Each of the 720 x 480 4:1:1 YCbCr frames 
are compressed to 103,950 bytes. Including 
overhead and audio increases the amount of 
data to 120,000 bytes. 

The compressed 720 x 480 frame is 
divided into 10 DIF (data in frame) sequences. 
Each DIF sequence contains 150 DIF blocks of 
80 bytes each, used as follows: 

135 DIF blocks for video 


9 DIF blocks for audio 


6 DIF blocks used for Header, Subcode, and 
Video Auxiliary (VAUX) information 


Figure 11.14 illustrates the DIF sequence 
structure in detail. Each video DIF block con- 
tains 80 bytes of compressed macroblock data: 


3 bytes for DIF block ID information 


1 byte for the header that includes the quantiza- 
tion number (QNO) and block status (STA) 


14 bytes each for YO, Y1, Y2, and Y3 
10 bytes each for Cb and Cr 


720 x 576 frames may use either the 4:2:0 
YCbCr format (IEC 61834) or the 4:1:1 YCbCr 
format (SMPTE 314M), and require 12 DIF 
sequences. Each 720 x 576 frame is com- 
pressed to 124,740 bytes. Including overhead 
and audio increases the amount of data to 
144,000 bytes, requiring 300 packets to trans- 
fer. 

Note that the organization of data trans- 
ferred over the interface differs from the 
actual DV recording format since error correc- 


tion is not required for digital transmission. In 
addition, although the video blocks are num- 
bered in sequence in Figure 11.15, the 
sequence does not correspond to the left-to- 
right, top-to-bottom transmission of blocks of 
video data. Compressed macroblocks are shuf- 
fled to minimize the effect of errors and aid in 
error concealment. Audio data is also shuffled. 
Data is transmitted in the same shuffled order 
as recorded. 

To illustrate the video data shuffling, DV 
video frames are organized as super blocks, 
with each super block being composed of 27 
compressed macroblocks, as shown in Figures 
11.7 through 11.11. A group of 5 super blocks 
(one from each super block column) make up 
one DIF sequence. Tables 11.6 and 11.7 illus- 
trate the transmission order of the DIF blocks. 

For the 50 Mbps SMPTE 308M format, 
each compressed 720 x 480 or 720 x 576 frame 
is divided into two channels. Each channel 
uses either 10 (480-line systems) or 12 DIF 
sequences (576-line systems). 


IEEE 1394 


Using the IEEE 1394 interface for transferring 
DV information is discussed in Chapter 6. 


SDTI 


The general concept of SDTI is discussed in 
Chapter 6. 


SMPTE 314M Data 

SMPTE 221M details how to transfer SMPTE 
314M DV data over SDTI. Figure 11.16 illus- 
trates the basic implementation. 


IEC 61834 Data 
SMPTE 222M details how to transfer IEC 
61834 DV data over SDTI. 
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1 FRAME IN 1.001/30 SECOND (10 DIF SEQUENCES) 


DIFSO DIFS1 DIFS2 DIFS3 DIFS4 


DIFS5 


DIFS6 


DIFS7 DIFS8 DIFS9 


______| 


~= 


1 DIF SEQUENCE IN 1.001/300 SECOND (150 DIF BLOCKS) 


HEADER 
(1 DIF) 


SUBCODE 
(2 DIF) 


VAUX 
(3 DIF) 


135 VIDEO AND 9 AUDIO DIF BLOCKS 


150 DIF BLOCKS IN 1.001/30 SECOND 


> 


DIFO DIF1 DIF2 DIF3 DIF4 


DIF5 


DIF6 


DIF148 | DIF149 


1 DIF BLOCK IN 1.001 / 45000 SECOND 


ID 
(3 BYTES) 


HEADER 
(1 BYTE) 


DATA 
(76 BYTES) 


YO (14 BYTES) Y1 (14 BYTES) Y2 (14 BYTES) Y3 (14 BYTES) CR (10 BYTES) CB (10 BYTES) 
=< > ~<t > <4 > <4 > <t > <t a 


AC 


DCO i DC2 E 


AC AC AC 


COMPRESSED MACROBLOCK 


<t 


Figure 11.14. Packet Formatting for 25 Mbps 4:1:1 YCbCr 720 x 480 Systems. 


References 


61834-1, Recording—Helical-scan 
digital video cassette recording system using 
6.35mm magnetic tape for consumer use 
(525-60, 625-50, 1125-60 and 1250-50 
systems)—Part 1: General specifications. 

. IEC 61834-2, Recording—Helical-scan 
digital video cassette recording system using 
6.35mm magnetic tape for consumer use 
(525-60, 625-50, 1125-60 and 1250-50 
systems)—Part 2: SD format for 525-60 and 
625-50 systems. 

. IEC 61834-4, Recording—Helical-scan 
digital video cassette recording system using 


. SMPTE 314M, for 


. SMPTE 321M, for 


6.35mm magnetic tape for consumer use 
(525-60, 625-50, 1125-60 and 1250-50 
systems)—Part 4: Pack header table and 
contents. 

Television—Data 
Structure for DV-Based Audio, Data and 
Compressed Video-25 and 50 Mbps. 
Television—Data 
Stream Format for the Exchange of DV- 
Based Audio, Data and Compressed Video 
Over a Serial Data Transport Interface. 


. SMPTE 322M, for Television—Format for 


Transmission of DV Compressed Video, 
Audio and Data Over a Serial Data 
Transport Interface. 


516 = Chapter 11: Consumer DV 
H = HEADER SECTION 
H sco sci vAo VA VA2 SCO, SC1 = SUBCODE SECTION 
VAO, VA1, VA2 = VAUX SECTION 
0 1 2 3 4 5 AO-A8 = AUDIO SECTION 
VO-V134 = VIDEO SECTION 
| AO vo V1 | v2 V3 V4 V5 V6 V7 V8 v9 Vv10 vii vi2 V13 V14 
6 7 8 9 10 11 12 13 14 15 16 a7 18 19 20 21 
>| Al V1i5 Walls} V17 V18 vig Vv20 v21 V22 V23 V24 V25 V26 V27 V28 v29 
22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 
A8& Vv120 vi21 v122 V123 V124 Vv125 V126 V127 V128 v129 V130 V131 V132 V133 V134 
134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 
Figure 11.15. DIF Sequence Detail. 
E 
HEADER FIXED FIXED FIXED FIXED 
7 DATA RACE BLOCK | BLOCK | BLOCK BLOCK erace 
mal SIGNAL TRANSMISSION | DIF BLOCK | DIF BLOCK | DIF BLOCK | DIF BLOCK 
DATA RESERVED ECC 
TYPE 3 WORDS TYPE TYPE ID DATA ID DATA 4 WORDS 
+ WORD 2 WORDS. 1 WORD 3 WORDS | 77 WORDS | 3 WORDS | 77 WORDS 
> 


STREAM BLOCK (170 WORDS) 


Figure 11.16. Transferring DV Data Using SDTI. 
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Video Compressed Video Compressed 
DIF DIF Macroblock DIF DIF Macroblock 
Sequence Block Sequence Block 
Number oc Superblock | Macroblock | Number oc Superblock | Macroblock 
Number Number Number Number Number Number 
0 2:2 0 
1 6,1 0 0 1,2 0 
2 8,3 0 1 5, 1 0 
6 3 0, 0 0 2 7,3 0 
4 4,4 0 3 n-1,0 0 
n-1 
4 3,4 0 
133 0, 0 26 
134 4,4 26 133 n-1,0 26 
0 322 0 134 3,4 26 
1 7,1 0 
2 9,3 0 
3 1,0 0 
1 
4 5, 4 0 
133 1,0 26 
134 5, 4 26 
Notes: 


1. n = 10 for 480-line systems, n = 12 for 576-line systems. 


Table 11.6. Video DIF Blocks and Compressed Macroblocks for 25 Mbps (4:1:1 or 4:2:0 YCbCr). 


518 = Chapter 11: Consumer DV 


Video Compressed Video Compressed 
DIF DIF Macroblock DIF DIF Macroblock 
Sequence Block Sequence Block 
Number oc Superblock | Macroblock Number oc Superblock | Macroblock 
Number Number Number Number Number Number 
0,0 4,2 0 
0,1 5, 2 0 0, 0 2,2 0 
1,0 12,1 0 0,1 3,2 0 
a apea § 13,1 0 1,0 10,1 0 
2,-0 16,3 0 1,1 11,1 0 
n-1 
: 2,0 14,3 0 
134, 0 8,4 26 : 
134, 1 9,4 26 134, 0 6, 4 26 
0,0 6, 2 0 134,1 7,4 26 
0,1 (A 0 
1,0 14,1 0 
a bea § 15,1 0 
1 
2-0 18,3 0 
134, 0 10, 4 26 
134, 1 11,4 26 
Notes: 


1. n = 10 for 480-line systems, n = 12 for 576line systems. 


Table 11.7. Video DIF Blocks and Compressed Macroblocks for 50 Mbps (4:2:2 YCbCr). 


ee 
MPEG 1 


MPEG 1 audio and video compression was 
developed for storing and distributing digital 
audio and video. Features include random 
access, fast forward, and reverse playback. 
MPEG 1 is used as the basis for the original 
video CDs. 

The channel bandwidth and image resolu- 
tion were set by the available media at the time 
(CDs). The goal was playback of digital audio 
and video using a standard compact disc with a 
bit rate of 1.416 Mbps (1.15 Mbps of this is for 
video). 

MPEG 1 is an ISO standard (ISO/IEC 
11172), and consists of six parts: 


system ISO/IEC 11172-1 
video ISO/IEC 11172-2 
audio ISO/IEC 11172-3 


ISO/IEC 13818-3 
ISO/IEC 11172-4 
ISO/IEC 11172-5 


low bit rate audio 
conformance testing 
simulation software 


The bitstreams implicitly define the 
decompression algorithms. The compression 
algorithms are up to the individual manufactur- 
ers, allowing a proprietary advantage to be 
obtained within the scope of an international 
standard. 


MPEG vs. JPEG 


JPEG (ISO/IEC 10918) was designed for still 
continuous-tone grayscale and color images. It 
doesn’t handle bi-level (black and white) 
images efficiently, and pseudo-color images 
have to be expanded into the unmapped color 
representation prior to processing. JPEG 
images may be of any resolution and color 
space, with both lossy and lossless algorithms 
available. 

Since JPEG is such a general purpose stan- 
dard, it has many features and capabilities. By 
adjusting the various parameters, compressed 
image size can be traded against reconstructed 
image quality over a wide range. Image quality 
ranges from “browsing” (100:1 compression 
ratio) to “indistinguishable from the source” 
(about 3:1 compression ratio). Typically, the 
threshold of visible difference between the 
source and reconstructed images is some- 
where between a 10:1 to 20:1 compression ratio. 

JPEG does not use a single algorithm, but 
rather a family of four, each designed for a cer- 
tain application. The most familiar lossy algo- 
rithm is sequential DCT: Either Huffman 
encoding (baseline JPEG) or arithmetic encod- 
ing may be used. When the image is decoded, 
it is decoded left-to-right, top-to-bottom. 
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Progressive DCT is another lossy algo- 
rithm, requiring multiple scans of the image. 
When the image is decoded, a coarse approxi- 
mation of the full image is available right away, 
with the quality progressively improving until 
complete. This makes it ideal for applications 
such as image database browsing. Either spec- 
tral selection, successive approximation, or 
both may be used. The spectral selection 
option encodes the lower-frequency DCT coef- 
ficients first (to obtain an image quickly), fol- 
lowed by the higher-frequency ones (to add 
more detail). The successive approximation 
option encodes the more significant bits of the 
DCT coefficients first, followed by the less sig- 
nificant bits. 

The hierarchical mode represents an 
image at multiple resolutions. For example, 
there could be 512 x 512, 1024 x 1024, and 
2048 x 2048 versions of the image. Higher- 
resolution images are coded as differences 
from the next smaller image, requiring fewer 
bits than they would if stored independently. 
Of course, the total number of bits is greater 
than that needed to store just the highest-res- 
olution image. Note that the individual images 
in a hierarchical sequence may be coded pro- 
gressively if desired. 

Also supported is a lossless spatial algo- 
rithm that operates in the pixel domain as 
opposed to the transform domain. A prediction 
is made of a sample value using up to three 
neighboring samples. This prediction then is 
subtracted from the actual value and the differ- 
ence is losslessly coded using either Huffman 
or arithmetic coding. Lossless operation 
achieves about a 2:1 compression ratio. 

Since video is just a series of still images, 
and baseline JPEG encoders and decoders 
were readily available, people used baseline 
JPEG to compress real-time video (also called 


motion JPEG or MJPEG). However, this tech- 
nique does not take advantage of the frame-to- 
frame redundancies to improve compression, 
as does MPEG. 

Perhaps most important, JPEG is symmet- 
rical, meaning the cost of encoding and decod- 
ing is roughly the same. MPEG, on the other 
hand, was designed primarily for mastering a 
video once and playing it back many times on 
many platforms. To minimize the cost of MPEG 
hardware decoders, MPEG was designed to be 
asymmetrical, with the encoding process 
requiring about 100x the computing power of 
the decoding process. 

Since MPEG 1s targeted for specific applica- 
tions, the hardware usually supports only a few 
specific resolutions. Also, only one color space 
(YCbCr) is supported using 8bit samples. 
MPEG is also optimized for a limited range of 
compression ratios. 

If capturing video for editing, you can use 
either baseline JPEG or [-frame-only (intra 
frame) MPEG to real-time compress to disk. 
Using JPEG requires that the system be able 
to transfer data and access the hard disk at bit 
rates of about 4 Mbps for SIF (Standard Input 
Format) resolution. Once the editing is done, 
the result can be converted into MPEG for 
maximum compression. 


Quality Issues 


At bit rates of about 3-4 Mbps, “broadcast 
quality” is achievable with MPEG 1. However, 
sequences with complex spatial-temporal activ- 
ity (such as sports) may require up to 5-6 
Mbps due to the frame-based processing of 
MPEG 1. MPEG 2 allows similar “broadcast 
quality” at bit rates of about 4-6 Mbps by sup- 
porting field-based processing. 
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Audio Overview 


Several factors affect the quality of MPEG- 
compressed video: 


e the resolution of the original video 
source 


e the bit rate (channel bandwidth) allowed 
after compression 


e motion estimator effectiveness 


One limitation of the quality of the com- 
pressed video is determined by the resolution 
of the original video source. If the original res- 
olution was too low, there will be a general lack 
of detail. 

Motion estimator effectiveness determines 
motion artifacts, such as a reduction in video 
quality when movement starts or when the 
amount of movement is above a certain thresh- 
old. Poor motion estimation will contribute to a 
general degradation of video quality. 

Most importantly, the higher the bit rate 
(channel bandwidth), the more information 
that can be transmitted, allowing fewer motion 
artifacts to be present or a higher resolution 
image to be displayed. Generally speaking, 
decreasing the bit rate does not result in a 
“graceful degradation” of the decoded video 
quality. The video quality rapidly degrades, 
with the 8 x 8 blocks becoming clearly visible 
once the bit rate drops below a given thresh- 
old. 


Audio Overview 


MPEG 1 uses a family of three audio coding 
schemes, called Layer 1, Layer 2, and Layer 3, 
with increasing complexity and sound quality. 
The three layers are hierarchical: a Layer 3 
decoder handles Layers 1, 2, and 3; a Layer 2 
decoder handles only Layers 1 and 2; a Layer 1 
decoder handles only Layer 1. All layers sup- 


port 16-bit digitized audio using 16, 22.05, 24, 
32, 44.1 or 48 kHz sampling rates. 

For each layer, the bitstream format and 
the decoder are specified. The encoder is not 
specified to allow for future improvements. All 
layers work with similar bit rates: 


Layer 1: 32-448 kbps 
Layer 2: 8-384 kbps 
Layer 3: 8-320 kbps 


Two audio channels are supported with 
four modes of operation: 


normal stereo 

joint (intensity and/or ms) stereo 
dual channel mono 

single channel mono 


For normal stereo, one channel carries the left 
audio signal and one channel carries the right 
audio signal. For intensity stereo (supported 
by all layers), high frequencies (above 2 kHz) 
are combined. The stereo image is preserved 
but only the temporal envelope is transmitted. 
For ms stereo (supported by Layer 3 only), 
one channel carries the sum signal (L+R) and 
the other the difference (L-R) signal. In addi- 
tion, pre-emphasis, copyright marks, and origi- 
nal/copy indication are supported. 


Sound Quality 


To determine which layer should be used for a 
specific application, look at the available bit 
rate, as each layer was designed to support 
certain bit rates with a minimum degradation 
of sound quality. 

Layer 1, a simplified version of Layer 2, has 
a target bit rate of about 192 kbps per channel 
or higher. 

Layer 2 is identical to MUSICAM, and has 
a target bit rate of about 128 kbps per channel. 
It was designed as a trade-off between sound 
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quality per bit rate and encoder complexity. It 
is most useful for bit rates around 96-128 kbps 
per channel. 

Layer 3 (also known as “mp3”) merges the 
best ideas of MUSICAM and ASPEC and has a 
target bit rate of about 64 kbps per channel. 
The Layer 3 format specifies a set of advanced 
features that all address a single goal: to pre- 
serve as much sound quality as possible, even 
at relatively low bit rates. 


Background Theory 


All layers use a coding scheme based on psy- 
choacoustic principles—in particular, “mask- 
ing” effects where, for example, a loud tone at 
one frequency prevents another, quieter, tone 
at a nearby frequency from being heard. 
Suppose you have a strong tone with a fre- 
quency of 1000 Hz, and a second tone at 1100 
Hz that is 18 dB lower in intensity. The 1100 
Hz tone will not be heard; it is masked by the 
1000 Hz tone. However, a tone at 2000 Hz 18 
dB below the 1000 Hz tone will be heard. In 
order to have the 1000 Hz tone mask it, the 
2000 Hz tone will have to be about 45 dB down. 
Any relatively weak frequency near a strong 
frequency is masked; the further you get from 
a frequency, the smaller the masking effect. 
Curves have been developed that plot the 
relative energy versus frequency that is 
masked (concurrent masking). Masking 
effects also occur before (premasking) and 
after (postmasking) a strong frequency if there 
is a significant (30-40 dB) shift in level. The 
reason is believed to be that the brain needs 
processing time. Premasking time is about 2-5 
ms; postmasking can last up to 100 ms. 
Adjusting the noise floor reduces the 
amount of needed data, enabling further com- 
pression. CDs use 16 bits of resolution to 
achieve a signal-to-noise ratio (SNR) of about 
96 dB, which just happens to match the 


dynamic range of hearing pretty well (meaning 
most people will not hear noise during 
silence). If 8-bit resolution were used, there 
would be a noticeable noise during silent 
moments in the music or between words. How- 
ever, noise isn’t noticed during loud passages. 
due to the masking effect, which means that 
around a strong sound you can raise the noise 
floor since the noise will be masked anyway. 

For a stereo signal, there usually is redun- 
dancy between channels. All layers may exploit 
these stereo effects by using a “joint stereo” 
mode, with the most flexible approach being 
used by Layer 3. 


Video Overview 


MPEG 1 permits resolutions up to 4095 x 4095 
at 60 frames per second (progressive scan). 
What many people think of as MPEG 1 is a 
subset known as Constrained Parameters Bit- 
stream (CPB). The CPB is a limited set of sam- 
pling and bit rate parameters designed to 
standardize buffer sizes and memory band- 
widths, allowing a nominal guarantee of 
interoperability for decoders and encoders, 
while still addressing the widest possible range 
of applications. Devices not capable of han- 
dling these are not considered to be true 
MPEG 1. Table 12.1 lists some of the con- 
strained parameters. 

The CPB limits video to 396 macroblocks 
(101,376 pixels). Therefore, MPEG 1 video is 
typically coded at SIF resolutions of 352 x 240 
or 352 x 288. During encoding, the original 
BT.601 resolution of 704 x 480 or 704 x 576 is 
scaled down to SIF resolution. This is usually 
done by ignoring field 2 and scaling down field 
1 horizontally. During decoding, the SIF reso- 
lution is scaled up to the 704 x 480 or 704 x 576 
resolution. Note that some entire active scan 
lines and samples on a scan line are ignored to 
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ensure the number of Y samples can be evenly 
divided by 16. Table 12.2 lists some of the 
more common MPEG 1 resolutions. 

The coded video rate is limited to 1.856 
Mbps. However, the bit rate is the most-often 
waived parameter, with some applications 
using up to 6 Mbps or higher. 

MPEG 1 video data uses the 4:2:0 YCbCr 
format shown in Figure 3.7. 


Interlaced Video 


MPEG 1 was designed to handle progressive 
(also referred to as noninterlaced) video. Early 
on, in an effort to improve video quality, sev- 
eral schemes were devised to enable the use of 
both fields of an interlaced picture. 

For example, both fields can be combined 
into a single frame of 704 x 480 or 704 x 576 
resolution and encoded. During decoding, the 
fields are separated. This, however, results in 
motion artifacts due to a moving object being 
in slightly different places in the two fields. 
Coding the two fields separately avoids motion 
artifacts, but reduces the compression ratio 
since the redundancy between fields isn’t used. 

There were many other schemes for han- 
dling interlaced video, so MPEG 2 defined a 
standard way of handling it (covered in Chap- 
ter 13). 


Encode Preprocessing 


Better images can be obtained by preprocess- 
ing the video stream prior to MPEG encod- 
ing. 

To avoid serious artifacts during encoding 
of a particular picture, prefiltering can be 
applied over the entire picture or just in spe- 
cific problem areas. Prefiltering before com- 
pression processing is analogous to anti-alias 


filtering prior to A/D conversion. Prefiltering 
may take into account texture patterns, 
motion, and edges, and may be applied at the 
picture, slice, macroblock, or block level. 

MPEG encoding works best on scenes 
with little fast or random movement and good 
lighting. For best results, foreground lighting 
should be clear and background lighting dif- 
fused. Foreground contrast and detail should 
be normal, but low contrast backgrounds con- 
taining soft edges are preferred. Editing tools 
typically allow you to preprocess potential 
problem areas. 

The MPEG 1 specification has example fil- 
ters for scaling down from BT.601 to SIF reso- 
lution. In this instance, field 2 is ignored, 
throwing away half the vertical resolution, and 
a decimation filter is used to reduce the hori- 
zontal resolution of the remaining scan lines by 
a factor of two. Appropriate decimation of the 
Cb and Cr components must still be carried 
out. 

Better video quality may be obtained by 
deinterlacing prior to scaling down to SIF reso- 
lution. When working on macroblocks (de- 
fined later), if the difference between macro- 
blocks between two fields is small, average 
both to generate a new macroblock. Other- 
wise, use the macroblock area from the field of 
the same parity to avoid motion artifacts. 


Coded Frame Types 


There are four types of coded frames. I (intra) 
frames (~1 bit/pixel) are frames coded as a 
stand-alone still image. They allow random 
access points within the video stream. As such, 
I frames should occur about two times a sec- 
ond. I frames should also be used where scene 
cuts occur. 
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horizontal resolution 


< 768 samples 


vertical resolution 


< 576 scan lines 


picture area 


< 396 macroblocks 


pel rate 


< 396 x 25 macroblocks per second 


picture rate 


< 30 frames per second 


bit rate 


< 1.856 Mbps 


Table 12.1. Some of the Constrained Parameters for MPEG 1. 


Resolution 


Frames per Second 


352 x 240 


29.97 


352 x 240 


23.976 


352 x 288 


25 


320 x 240! 


29.97 


384 x 288! 


25 


Notes: 
1. Square pixel format. 


Table 12.2. Common MPEG 1 Resolutions. 


P (predicted) frames (~0.1 bit/pixel) are 
coded relative to the nearest previous I or P 
frame, resulting in forward prediction pro- 
cessing, as shown in Figure 12.1. P frames 
provide more compression than I frames, 
through the use of motion compensation, and 
are also a reference for B frames and future P 
frames. 

B (bi-directional) frames (~0.015 bit/pixel) 
use the closest past and future I or P frame as a 
reference, resulting in bidirectional predic- 
tion, as shown in Figure 12.1. B frames provide 
the most compression and decrease noise by 
averaging two frames. Typically, there are two 
B frames separating I or P frames. 


D (DC) frames are frames coded as a 
stand-alone still image, using only the DC com- 
ponent of the DCTs. D frames may not be in a 
sequence containing any other frame types 
and are rarely used. 

A group of pictures (GOP) is a series of 
one or more coded frames intended to assist in 
random accessing and editing. The GOP value 
is configurable during the encoding process. 
The smaller the GOP value, the better the 
response to movement (since the I frames are 
closer together), but the lower the compres- 
sion. 
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FRAME DISPLAY 
ORDER 


FORWARD 
PREDICTION 


FRAME TRANSMIT 
ORDER 


INTRA (|) FRAME 


aN 


BI-DIRECTIONAL 
PREDICTION 


BI-DIRECTIONAL (B) FRAME 


[ PREDICTED (P) FRAME 


Figure 12.1. MPEG 1 I, P, and B Frames. Some frames are transmitted out of display sequence, 
complicating the interpolation process, and requiring frame reordering by the MPEG decoder. 


Arrows show inter-frame dependencies. 


In the coded bitstream, a GOP must start 
with an I frame and may be followed by any 
number of I, P or B frames in any order. In dis- 
play order, a GOP must start with an I or B 
frame and end with an I or P frame. Thus, the 
smallest GOP size is a single I frame, with the 
largest size unlimited. 

Originally, each GOP was to be coded and 
displayed independently of any other GOP. 
However, this is not possible unless no B 
frames precede I frames, or if they do, they use 
only backward motion compensation. This 
results in both open and closed GOP formats. 
A closed GOP is a GOP that can be decoded 
without using frames of the previous GOP for 
motion compensation. An open GOP requires 
that they be available. 


Motion Compensation 


Motion compensation improves compression 
of P and B frames by removing temporal 
redundancies between frames. It works at the 
macroblock (defined later) level. 

The technique relies on the fact that within 
a short sequence of the same general image, 
most objects remain in the same location, 
while others move only a short distance. The 
motion is described as a _ two-dimensional 
motion vector that specifies where to retrieve a 
macroblock from a previously decoded frame 
to predict the sample values of the current 
macroblock. 
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After a macroblock has been compressed 
using motion compensation, it contains both 
the spatial difference (motion vectors) and 
content difference (error terms) between the 
reference macroblock and macroblock being 
coded. 

Note that there are cases where informa- 
tion in a scene cannot be predicted from the 
previous scene, such as when a door opens. 
The previous scene doesn’t contain the details 
of the area behind the door. In cases such as 
this, when a macroblock in a P frame cannot be 
represented by motion compensation, it is 
coded the same way as a macroblock in an I 
frame (using intra-picture coding). 

Macroblocks in B frames are coded using 
either the closest previous or future I or P 
frames as a reference, resulting in four possi- 
ble codings: 


e intra coding 
no motion compensation 


¢ forward prediction 
closest previous I or P frame is the 
reference 


e backward prediction 
closest future I or P frame is the 
reference 


¢ bi-directional prediction 
two frames are used as the reference: 
the closest previous I or P frame and 
the closest future I or P frame 


Backward prediction is used to predict “uncov- 
ered” areas that appear in previous frames. 


I Frames 


Image blocks and prediction error blocks have 
a high spatial redundancy. Several steps are 


used to remove this redundancy within a frame 
to improve the compression. The inverse of 
these steps is used by the decoder to recover 
the data. 


Macroblock 

A macroblock (shown in Figure 7.46) consists 
of a 16-sample x 16-line set of Y components 
and the corresponding two 8sample x 8-line 
Cb and Cr components. 

A block is an 8-sample x 8-line set of Y, Cb, 
or Cr values. Note that a Y block refers to one- 
fourth the image size as the corresponding Cb 
or Cr blocks. Thus, a macroblock contains four 
Y blocks, one Cb block, and one Cr block, as 
seen in Figure 12.2. 

There are two types of macroblocks in I 
frames, both using intra coding, as shown in 
Table 12.9. One (called intra-d) uses the cur- 
rent quantizer scale; the other (called intra-q) 
defines a new value for the quantizer scale 

If the macroblock type is intra-q, the mac- 
roblock header specifies a 5-bit quantizer scale 
factor. The decoder uses this to calculate the 
DCT coefficients from the transmitted quan- 
tized coefficients. Quantizer scale factors may 
range from 1-31, with zero not allowed. 

If the macroblock type is intra-d, no quan- 
tizer scale is sent, and the decoder uses the 
current one. 


DCT 
Each 8 x 8 block (of input samples or predic- 
tion error terms) is processed by an 8 x 8 DCT 
(discrete cosine transform), resulting in an 8 x 
8 block of horizontal and vertical frequency 
coefficients, as shown in Figure 7.47. 

Input sample values are 0-255, resulting in 
a range of 0-2,040 for the DC coefficient and a 
range of about —1,000 to +1,000 for the AC coef- 
ficients. 
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EACH MACROBLOCK |S 
16 SAMPLES BY 16 LINES 
(4 Y BLOCKS) 


BLOCK ARRANGEMENT 
WITHIN A MACROBLOCK 


CB DCT5 


CR DCT 4 


EACH Y BLOCK IS 8 
SAMPLES BY 8 LINES 


Figure 12.2. MPEG 1 Macroblocks and Blocks. 


Quantizing 

The 8 x 8 block of frequency coefficients are 
uniformly quantized, limiting the number of 
allowed values. The quantizer step scale is 
derived from the quantization matrix and 
quantizer scale and may be different for dif 
ferent coefficients and may change between 
macroblocks. 

The quantizer step size of the DC coeffi- 
cients is fixed at eight. The DC quantized coef 
ficient is determined by dividing the DC 
coefficient by eight and rounding to the near- 
est integer. AC coefficients are quantized using 
the intra-quantization matrix. 


Zig-Zag Scan 

Zig-zag scanning, starting with the DC compo- 
nent, generates a linear stream of quantized 
frequency coefficients arranged in order of 
increasing frequency, as shown in Figure 7.50. 
This produces long runs of zero coefficients. 


Coding of Quantized DC Coefficients 
After the DC coefficients have been quantized, 
they are losslessly coded. 

Coding of Y blocks within a macroblock fol- 
lows the order shown in Figure 12.2. The DC 
value of block 4 is the DC predictor for block 1 
of the next macroblock. At the beginning of 
each slice, the DC predictor is set to 1,024. 

The DC values of each Cb and Cr block are 
coded using the DC value of the correspond- 
ing block of the previous macroblock as a pre- 
dictor. At the beginning of each slice, both DC 
predictors are set to 1,024. 

The DCT DC differential values are orga- 
nized by their absolute value as shown in Table 
12.16. [size], which specifies the number of 
additional bits to define the level uniquely, is 
transmitted by a variable-length code, and is 
different for Y and CbCr since the statistics are 
different. For example, a size of four is fol- 
lowed by four additional bits. 

The decoder reverses the procedure to 
recover the quantized DC coefficients. 
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Coding of Quantized AC Coefficients 

After the AC coefficients have been quantized, 
they are scanned in the zig-zag order shown in 
Figure 7.50 and coded using run-length and 
level. The scan starts in position 1, as shown in 
Figure 7.50, as the DC coefficient in position 0 
is coded separately. 

The run-lengths and levels are coded as 
shown in Table 12.18. The “s” bit denotes the 
sign of the level; “0” is positive and “1” is nega- 
tive. 

For run-level combinations not shown in 
Table 12.18, an escape sequence is used, con- 
sisting of the escape code (ESC), followed by 
the run-length and level codes from Table 
12.19. 

After the last DCT coefficient has been 
coded, an EOB code is added to tell the 
decoder that there are no more quantized coef- 
ficients in this 8 x 8 block. 


P Frames 


Macroblocks 
There are eight types of macroblocks in P 
frames, as shown in Table 12.10, due to the 
additional complexity of motion compensation. 

Skipped macroblocks are predicted mac- 
roblocks with a zero motion vector. Thus, no 
correction is available; the decoder copies 
skipped macroblocks from the previous frame 
into the current frame. The advantage of 
skipped macroblocks is that they require very 
few bits to transmit. They have no code; they 
are coded by having the macroblock address 
increment code skip over them. 

If the [macroblock quant] column in Table 
12.10 has a “1,” the quantizer scale is transmit- 
ted. For the remaining macroblock types, the 


DCT correction is coded using the previous 
value for quantizer scale. 

If the [motion forward] column in Table 
12.10 has a “1,” horizontal and vertical forward 
motion vectors are successively transmitted. 

If the [coded pattern] column in Table 
12.10 has a “1,” the 6-bit coded block pattern is 
transmitted as a variable-length code. This tells 
the decoder which of the six blocks in the mac- 
roblock are coded (“1”) and which are not 
coded (“0”). Table 12.14 lists the codewords 
assigned to the 63 possible combinations. 
There is no code for when none of the blocks 
are coded; it is indicated by the macroblock 
type. For macroblocks in I frames and for intra- 
coded macroblocks in P and B frames, the 
coded block pattern is not transmitted, but is 
assumed to be a value of 63 (all blocks are 
coded). 

To determine which type of macroblock to 
use, the encoder typically makes a series of 
decisions, as shown in Figure 12.3. 


DCT 
Intra block AC coefficients are transformed in 
the same manner as they are for I frames. Intra 
block DC coefficients are transformed differ- 
ently; the predicted values are set to 1,024, 
unless the previous block was intra-coded. 

Non-intra block coefficients represent dif- 
ferences between sample values rather than 
actual sample values. They are obtained by 
subtracting the motion compensated values of 
the previous frame from the values in the cur- 
rent macroblock. There is no prediction of the 
DC value. 

Input sample values are -255 to +255, 
resulting in a range of about —2,000 to +2,000 
for the AC coefficients. 
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CODED 


QUANT PRED-MCQ 


NON-INTRA 


MOTION 


NOT CODED 


NO QUANT PRED-MC 


PRED-M 


COMPENSATION 


INTRA 


QUANT INTRA-Q 


CODED 


NO QUANT INTRA-D 


QUANT PRED-CQ 


NON-INTRA 


NO MOTION 


NOT CODED 


NO QUANT PRED-C 


SKIPPED 


COMPENSATION 


INTRA 


QUANT INTRA-Q 


NO QUANT INTRA-D 


Figure 12.3. MPEG 1 P Frame Macroblock Type Selection. 


Quantizing 
Intra blocks are quantized in the same manner 
as they are for I frames. 

Non-intra blocks are quantized using the 
quantizer scale and the non-intra quantization 
matrix. The AC and DC coefficients are quan- 
tized in the same manner. 


Coding of Intra Blocks 

Intra blocks are coded the same way as I frame 
intra blocks. There is a difference in the han- 
dling of the DC coefficients in that the pre- 
dicted value is 128, unless the previous block 
was intra coded. 


Coding of Non-Intra Blocks 

The coded block pattern (CBP) is used to 
specify which blocks have coefficient data. 
These are coded similarly to the coding of intra 
blocks, except the DC coefficient is coded in 
the same manner as the AC coefficients. 


B Frames 


Macroblocks 
There are 12 types of macroblocks in B 
frames, as shown in Table 12.11, due to the 
additional complexity of backward motion 
compensation. 

Skipped macroblocks are macroblocks 
having the same motion vector and macro- 
block type as the previous macroblock, which 
cannot be intra coded. The advantage of 
skipped macroblocks is that they require very 
few bits to transmit. They have no code; they 
are coded by having the macroblock address 
increment code skip over them. 

If the [macroblock quant] column in Table 
12.11 has a “1,” the quantizer scale is transmit- 
ted. For the rest of the macroblock types, the 
DCT correction is coded using the previous 
value for the quantizer scale. 
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FORWARD MC 


BACKWARD MC 


INTERPOLATED MC A 


CODED 


QUANT PRED-*CQ 


NON-INTRA 


NOT CODED 


NO QUANT PRED-*C 


PRED-* OR SKIPPED 


INTRA 


QUANT INTRA-Q 


NO QUANT INTRA-D 


Figure 12.4. MPEG 1 B Frame Macroblock Type Selection. 


If the [motion forward] column in Table 
12.11 has a “1,” horizontal and vertical forward 
motion vectors are successively transmitted. If 
the [motion backward] column in Table 12.11 
has a “1,” horizontal and vertical backward 
motion vectors are successively transmitted. If 
both forward and backward motion types are 
present, the vectors are transmitted in this 
order: 


horizontal for ward 
vertical for ward 
horizontal backward 
vertical backward 


If the [coded pattern] column in Table 
12.11 has a “1,” the 6-bit coded block pattern is 
transmitted as a variable-length code. This tells 
the decoder which of the six blocks in the mac- 
roblock are coded (“1”) and which are not 
coded (“0”). Table 12.14 lists the codewords 
assigned to the 63 possible combinations. 
There is no code for when none of the blocks 


are coded; this is indicated by the macroblock 
type. For macroblocks in I frames and for intra- 
coded macroblocks in P and B frames, the 
coded block pattern is not transmitted, but is 
assumed to be a value of 63 (all blocks are 
coded). 

To determine which type of macroblock to 
use, the encoder typically makes a series of 
decisions, shown in Figure 12.4. 


Coding 

DCT coefficients of blocks are transformed 
into quantized coefficients and coded in the 
same way they are for P frames. 


D Frames 


D frames contain only DC-frequency data and 
are intended to be used for fast visible search 
applications. The data contained in a D frame 
should be just sufficient for the user to locate 
the desired video. 
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Video Bitstream 


Figure 12.5 illustrates the video bitstream, a 
hierarchical structure with seven layers. From 
top to bottom the layers are: 


Video Sequence 
Sequence Header 

Group of Pictures (GOP) 
Picture 

Slice 

Macroblock (MB) 
Block 


Video Sequence 


Sequence_end_code 
This 32-bit field has a value of 000001B7y and 
terminates a video sequence. 


Sequence Header 


Data for each sequence consists of a sequence 
header followed by data for group of pictures 
(GOPs). The structure is shown in Figure 12.5. 


Sequence_header_code 
This 32-bit field has a value of 000001B3y and 
indicates the beginning of a sequence header. 


Horizontal_size 

This 12-bit binary value specifies the width of 
the viewable portion of the Y component. The 
width in macroblocks is defined as 
(horizontal_size + 15)/16. 


Ver tical_size 

This 12-bit binary value specifies the height of 
the viewable portion of the Y component. The 
height in macroblocks is defined as 
(vertical_size + 15)/16. 


Pel_aspect_ratio 
This 4-bit codeword indicates the pixel aspect 
ratio, as shown in Table 12.3. 


Picture_rate 
This 4-bit codeword indicates the frame rate, 
as shown in Table 12.4. 


Bit_rate 

An 18-bit binary value specifying the bitstream 
bit rate, measured in units of 400 bps rounded 
upwards. A zero value is not allowed; a value of 
3FFFFy specifies variable bit rate operation. If 
constrained_parameters_flag is a “1,” the bit 
rate must be <1.856 Mbps. 


Marker_bit 
Always a “1.” 


Vbv_buffer_size 

This 10-bit binary number specifies the mini- 
mum size of the video buffering verifier 
needed by the decoder to properly decode the 
sequence. It is defined as: 


B= 16 x 1024 x vbv_buffer_size 


If the constrained_parameters_flag bit is a “1,” 
the vbv_buffer_size must be <40 kB. 


Constrained_parameters_flag 
This bit is set to a “1” if the following con- 
straints are met: 


horizontal_size < 768 samples 

vertical_size < 576 lines 

((horizontal_size + 15)/16) x ((vertical_size + 
15) /16) < 396 

((horizontal_size + 15)/16) x ((vertical_size + 
15) /16) x picture_rate < 396*25 

picture_rate < 30 frames per second 

forward_f_code <4 

backward_f_code <4 
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SEQUENCE HEADER 


GOP LAYER 


PICTURE LAYER 


SLICE LAYER 


MB LAYER 


BLOCK LAYER 


SEQUENCE | Horizontal | vertical | aspect | PicTURE BIT nhs CONSTRAINED 
HEADER BUFFER | PARAMETERS 
SIZE SIZE RATIO. RATE RATE 
CODE SIZE FLAG 
INTRA NON-INTRA | EXTENSION | SEQUENCE USER DATA GigE 
L_, QUANTIZER QUANTIZER START EXTENSION START ee Gop | +++] Gop 
MATRIX MATRIX CODE DATA CODE 
GROUP ae ctosep | Broken | EXTENSION GROUP USER DATA igen 
START START EXTENSION START PICTURE | +++ | PICTURE 
CODE GOP LINK DATA 
CODE CODE DATA CODE 
PICTURE PICTURE FULL PEL FULL PEL 
TEMPORAL VBV FORWARD BACKWARD 
START CODING FORWARD BACKWARD 
REFERENCE DELAY F CODE F CODE 
CODE TYPE VECTOR VECTOR 
EXTRA EXTENSION PICTURE USER DATA pees 
L_| INFORMATION START EXTENSION START BAA SLICE see | SLICE 
PICTURE CODE DATA CODE 
SLICE EXTRA 
QUANTIZER 
START INFORMATION | MB | e+ | MB 
SCALE 
CODE SLICE 
MB MOTION MOTION MOTION MOTION 
MB MB MB | QUANTIZER 
ADDRESS HORIZONTAL VERTICAL | HORIZONTAL VERTICAL — 
STUFFING | ESCAPE TYPE SCALE 
INCREMENT FORWARD FORWARD BACKWARD | BACKWARD 
CODED END 
L_ BLOCK Bt see BG OF 
PATTERN MB 
DCT DC DCT DC DCT DC DCT DC DCT DC DCT DC END 
SIZE SIZE SIZE SIZE COEFFICIENT | COEFFICIENT | OF 
LUMINANCE | DIFFERENTIAL | CHROMINANCE | DIFFERENTIAL FIRST NEXT BLOCK 


Figure 12.5. MPEG 1 Video Bitstream Layer Structures. Marker and reserved bits not shown. 


Video Bitstream 
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Height / Width Example Aspect Ratio Code 
forbidden 0000 
1.0000 square pixel 0001 
0.6735 0010 
0.7031 625-line 16:9 0011 
0.7615 0100 
0.8055 0101 
0.8437 525-line 16:9 0110 
0.8935 0111 
0.9157 625-line BT.601 1000 
0.9815 1001 
1.0255 1010 
1.0695 1011 
1.0950 525-line BT.601 1100 
1.1575 1101 
1.2015 1110 
reserved 1111 


Table 12.3. MPEG 1 pel_aspect_ratio Codewords. 


Frames Picture 
Per Second Rate Code 
forbidden 0000 
24/1.001 0001 
24 0010 
25 0011 
30/1.001 0100 
30 0101 
50 0110 
60/1.001 0111 
60 1000 
reserved 1001 
reserved 1010 
reserved 1011 
reserved 1100 
reserved 1101 
reserved 1110 
reserved 1111 


Table 12.4. MPEG 1 picture_rate Codewords. 
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Load_intra_quantizer_matrix 

This bit is set to a “1” if intra_quantizer_matrix 
follows. If set to a “0,” the default values below 
are used until the next occurrence of a 
sequence header. 


Intra_quantizer_matrix 

An optional list of sixty-four 8-bit values that 
replace the current intra quantizer values. A 
value of zero is not allowed. The value for 
intra_quant [0, 0] is always eight. These values 
take effect until the next occurrence of a 
sequence header. 


Load_non_intra_quantizer_matrix 

This bit is set to a “ll” if 
non_intra_quantizer_matrix follows. If set to a 
“0,” the default values below are used until the 
next occurrence of a sequence header. 


Non_intra_quantizer_matrix 

An optional list of sixty-four 8-bit values that 
replace the current non-intra quantizer values. 
A value of zero is not allowed. These values 
take effect until the next occurrence of a 
sequence header. 


Extension_start_code 

This optional 32-bit string of 000001B5y, indi- 
cates the beginning of sequence_extension_data. 
sequence_extension_data continues until the 
detection of another start code. 


Sequence_extension_data 
These n x 8 bits are present only if 
extension_start_code is present. 


User_data_start_code 

This optional 32-bit string of 000001B2y, indi- 
cates the beginning of user_data. user_data 
continues until the detection of another start 
code. 


User_data 

These n x 8 bits are present only if 
user_data_start_code is present. user_data 
must not contain a string of 23 or more consec- 
utive zero bits. 


Group of Pictures (GOP) Layer 


Data for each group of pictures consists of a 
GOP header followed by picture data. The 
structure is shown in Figure 12.5. 


Group_start_code 
This 32-bit value of 000001B8y indicates the 
beginning of a group of pictures. 
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7 Range Number of 
Hime:Coge of eine Bits 
drop_frame_flag 1 
time_code_hours 0-23 5 
time_code_minutes 0-59 6 
marker_bit 1 1 
time_code_seconds 0-59 6 
time_code_pictures 0-59 6 


Table 12.5. MPEG 1 time_code Field. 


Time_code 

These 25 bits indicate timecode information, as 
shown in Table 12.5. [drop_frame_flag] may 
be set to “1” only if the picture rate is 30/1.001 
(29.97) Hz. 


Closed_gop 

This 1-bit flag is set to “1” if the group of pic- 
tures has been encoded without motion vec- 
tors referencing the previous group of 
pictures. This bit allows support of editing the 
compressed bitstream. 


Broken_link 

This 1-bit flag is set to a “O” during encoding. It 
is set to a “1” during editing when the B frames 
following the first I frame of a group of pictures 
cannot be correctly decoded. 


Extension_start_code 

This optional 32-bit string of 000001B5y, indi- 
cates the beginning of group_extension_data. 
group_extension_data continues until the detec- 
tion of another start code. 


Group_extension_data 
These n x 8 bits are present only if 
extension_start_code is present. 


User_data_start_code 

This optional 32-bit string of 000001B2y, indi- 
cates the beginning of user_data. user_data 
continues until the detection of another start 
code. 


User_data 

These n x 8 bits are present only if 
user_data_start_code is present. user_data 
must not contain a string of 23 or more consec- 
utive zero bits. 


Picture Layer 


Data for each picture layer consists of a picture 
header followed by slice data. The structure is 
shown in Figure 12.5. 


Picture_start_code 
This has a 32-bit value of 00000100q. 
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Temporal_reference 

For the first frame in display order of each 
group of pictures, the temporal_reference value 
is zero. This 10-bit binary value then incre- 
ments by one, modulo 1024 for each frame in 
display order. 


Picture_coding_type 

This 3-bit codeword indicates the frame type (I 
frame, P frame, B frame, or D frame), as shown 
in Table 12.6. D frames are not to be used in 
the same video sequence as other frames. 


nag Code 
forbidden 000 
I frame 001 
P frame 010 
B frame 011 
D frame 100 
reserved 101 
reserved 110 
reserved 111 


Table 12.6. MPEG 1 picture_coding_type 
Code. 


Vbv_delay 

For constant bit rates, the 16-bit vbv_delay 
binary value sets the initial occupancy of the 
decoding buffer at the start of decoding a pic- 
ture so that it doesn’t overflow or underflow. 
For variable bit rates, vbv_delay has a value of 
FFFFy. 


Full_pel_forward_vector 

This 1-bit flag is present if picture_coding_type 
is “010” (P frames) or “011” (B frames). If a “1,” 
the forward motion vectors are based on inte- 
ger samples, rather than half-samples. 


Forward_f_code 

This 3-bit binary number is present if 
picture_coding_type is “010” (P frames) or 
“011” (B frames). Values of “001” to “111” are 
used; a value of “000” is forbidden. 

Two parameters used by the decoder to 
decode the forward motion vectors are derived 
from this field: forward_r_size and forward_f. 
forward_r_size is one less than forward_f_code. 
forward_fis defined in Table 12.7. 


Forward F Forward F 
Code Value 
001 1 
010 2 
011 4 
100 8 
101 16 
110 32 
111 64 


Table 12.7. MPEG 1 forward_f_code 
Values. 


Full_pel_backward_vector 

This 1-bit flag is present if picture_coding_type 
is “O11” (B frames). If a “1,” the backward 
motion vectors are based on integer samples, 
rather than half-samples. 


Backward_f_code 

This 3-bit binary number is present if 
picture_coding_type is “011” (B frames). Values 
of “001” to 111” are used; a value of “000” is for- 
bidden. 

Two parameters used by the decoder to 
decode the backward motion vectors are 
derived from this field: backward_r_size and 
backward_f. backward_r_size is one less than 
backward_f_code. backward_f is defined the 
same as forward_f. 
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Extra_bit_picture 
A bit which, when set to “1,” indicates that 
extra_information_picture follows. 


Extra_information_picture 

If extra_bit_picture = “1,” then these 9 bits fol- 
low consisting of 8 bits of data 
(extra_information_picture) and then another 
extra_bit_picture to indicate if a further 9 bits 
follow, and so on. 


Extension_start_code 

This optional 32-bit string of 000001B5y, indi- 
cates the beginning of picture_extension_data. 
picture_extension_data continues until the 
detection of another start code. 


Picture_extension_data 
These n x 8 bits are present only if 
extension_start_code is present. 


User_data_start_code 

This optional 32-bit string of 000001B2y, indi- 
cates the beginning of user_data. user_data 
continues until the detection of another start 
code. 


User_data 

These n x 8 bits are present only if 
user_data_start_code is present. User data 
must not contain a string of 23 or more consec- 
utive zero bits. 


Slice Layer 


Data for each slice layer consists of a slice 
header followed by macroblock data. The 
structure is shown in Figure 12.5. 


Slice_start_code 
The first 24 bits of this 32-bit field have a value 
of 000001y. The last 8 bits are the 
slice_vertical_position, and have a value of 01q- 
AFy. 

slice_vertical_position specifies the verti- 
cal position in macroblock units of the first 
macroblock in the slice. The value of the first 
row of macroblocks is one. 


Quantizer_scale 

This 5-bit binary number has a value of 1-31 (a 
value of 0 is forbidden). It specifies the scale 
factor of the reconstruction level of the DCT 
coefficients. The decoder uses this value until 
another quantizer_scale is received at the 
either the slice or macroblock layer. 


Extra_bit_slice 
A bit which, when set to “1,” indicates that 
extra_information_slice follows. 


Extra_information_slice 

If extra_bit_slice = “1,” then these 9 bits follow 
consisting of 8 bits of data 
(extra_information_slice) and then another 
extra_bit_slice to indicate if a further 9 bits fol- 
low, and so on. 
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Macroblock (MB) Layer 


Data for each macroblock layer consists of a 
macroblock header followed by motion vectors 
and block data. The structure is shown in Fig- 
ure 12.5. 


Macroblock_stuffing 

This optional 11-bit field is a fixed bit string of 
“0000 0001 111” and may be used to increase 
the bit rate to match the storage or transmis- 
sion requirements. Any number of consecutive 
macroblock_stuffing fields may be used. 


Macroblock_escape 

This optional 11-bit field is a fixed bit string of 
“0000 0001 000” and is used when the differ- 
ence between the current macroblock address 
and the previous macroblock address is 
greater than 33. It forces the value of 
macroblock_address_increment to be increased 
by 33. Any number of consecutive 
macroblock_escape fields may be used. 


Macroblock_address_increment 

This is a variable-length codeword that speci- 
fies the difference between the current mac- 
roblock address and the previous macroblock 
address. It has a maximum value of 33. Values 
greater than 33 are encoded using the 
macroblock_escape field. The variable-length 
codes are listed in Table 12.8. 


Macroblock_type 

This is a variable-length codeword that speci- 
fies the coding method and macroblock con- 
tent. The variable-length codes are listed in 
Tables 12.9 through 12.12. 


Quantizer_scale 

This optional 5-bit binary number has a value 
of 1-31 (a value of 0 is forbidden). It specifies 
the scale factor of the reconstruction level of 
the received DCT coefficients. The decoder 
uses this value until another quantizer_scale is 
received at the either the slice or macroblock 
layer. The quantizer_scale field is present only 
when [macroblock quant] = “1” in Tables 12.9 
through 12.12. 


Motion_horizontal_for ward_code 

This optional variable-length codeword con- 
tains forward motion vector information as 
defined in Table 12.13. It is present only when 
[motion forward] = “1” in Tables 12.9 through 
12.12. 


Motion_horizontal_forward_r 

This optional binary number (of forward_r_size 
bits) is used to help decode the forward 
motion vectors. It is present only when 
[motion forward] = “1” in Tables 12.9 through 
12.12, forward_fcode # “O01,” and 
motion_horizontal_forward_code # “0.” 


Motion_vertical_forward_code 

This optional variable-length codeword con- 
tains forward motion vector information as 
defined in Table 12.13. It is present only when 
[motion forward] = “1” in Tables 12.9 through 
12.12. 


Motion_vertical_forward_r 

This optional binary number (of forward_r_size 
bits) is used to help decode the forward 
motion vectors. It is present only when 
[motion forward] = “1” in Tables 12.9 through 
12.12, forward_fcode # “O01,” and 
motion_vertical_forward_code # “0.” 
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Table 12.8. MPEG 1 Variable-Length Code Table for macroblock_address_increment. 


ee | cee | eee | een 
1 1 17 0000 0101 10 
2 011 18 0000 0101 01 
3 010 19 0000 0101 00 
4 0011 20 0000 0100 11 
5 0010 21 0000 0100 10 
6 0001 1 22 0000 0100 011 
7 0001 0 23 0000 0100 010 
8 0000 111 24 0000 0100 001 
9 0000 110 25 0000 0100 000 
10 0000 1011 26 0000 0011 111 
11 0000 1010 27 0000 0011 110 
12 0000 1001 28 0000 0011 101 
13 0000 1000 29 0000 0011 100 
14 0000 0111 30 0000 0011 011 
15 0000 0110 31 0000 0011 010 
16 0000 0101 11 32 0000 0011 001 
33 0000 0011 000 


Macroblock | Macroblock Motion Motion Coded Intra Code 
Type Quant Forward Backward Pattern Macroblock 

intra-d 0 0 0 0 1 1. 

intra-q 1 0 0 0 1 01 


Table 12.9. MPEG 1 Variable-Length Code Table for macroblock_type for | Frames. 


Macroblock | Macroblock Motion Motion Coded Intra Code 
Type Quant Forward Backward Pattern Macroblock 
pred-mc 0 1 0 1 0 1 
pred-c 0 0 0 1 0 01 
pred-m 0 1 0 0 0 001 
intra-d 0 0 0 0 1 0001 1 
pred-mcq 1 1 0 1 0 0001 0 
pred-cq 1 0 0 1 0 0000 1 
intra-q 1 0 0 0 1 0000 01 
skipped 


Table 12.10. MPEG 1 Variable-Length Code Table for macroblock_type for P Frames. 
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Macroblock | Macroblock Motion Motion Coded Intra Code 
Type Quant Forward Backward Pattern Macroblock 
pred-i 0 1 1 0 0 10 
pred-ic 0 1 1 1 0 11 
pred-b 0 0 1 0 0 010 
intra-be 0 0 1 1 0 011 
pred-f 0 1 0 0 0 0010 
pred-fc 0 1 0 1 0 0011 
intra-d 0 0 0 0 1 0001 1 
pred-icq 1 1 1 1 0 0001 0 
pred-fcq 1 1 0 1 0 0000 11 
pred-bcq 1 0 1 1 0 0000 10 
intra-q 1 0 0 0 1 0000 01 
skipped 


Table 12.11. MPEG 1 Variable-Length Code Table for macroblock_type for B Frames. 


Macroblock Motion Motion Coded Intra Code 
Quant Forward Backward Pattern Macroblock 
0 0 0 0 1 1 


Table 12.12. MPEG 1 Variable-Length Code Table for macroblock_type for D Frames. 
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Motion Vector Code Motion Vector Code 
Difference Difference 

-16 0000 0011 001 1 010 

-15 0000 0011 011 2 0010 

-14 0000 0011 101 3 0001 0 

-13 0000 0011 111 4 0000 110 

-12 0000 0100 001 5 0000 1010 

-11 0000 0100 011 6 0000 1000 

-10 0000 0100 11 7 0000 0110 
-9 0000 0101 01 8 0000 0101 10 
-8 0000 0101 11 9 0000 0101 00 
-7 0000 0111 10 0000 0100 10 
-6 0000 1001 11 0000 0100 010 
-5 0000 1011 12 0000 0100 000 
-4 0000 111 13 0000 0011 110 
-3 0001 1 14 0000 0011 100 
-2 0011 15 0000 0011 010 
-1 011 16 0000 0011 000 

0 1 


Table 12.13. MPEG 1 Variable-Length Code Table for 
motion_horizontal_forward_code, motion_vertical_forward_code, 
motion_horizontal_backward_code, and motion_vertical_backward_code. 
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Motion_horizontal_backward_code 

This optional variable-length codeword con- 
tains backward motion vector information as 
defined in Table 12.13. It is present only when 
[motion backward] = “1” in Tables 12.9 
through 12.12. 


Motion_horizontal_backward_r 

This optional binary number (of 
backward_r_size bits) is used to help decode 
the backward motion vectors. It is present only 
when [motion backward] = “1” in Tables 12.9 
through 12.12, backward_f_code # “001,” and 
motion_horizontal_backward_code # “0.” 


Motion_vertical_backward_code 

This optional variable-length codeword con- 
tains backward motion vector information as 
defined in Table 12.13. The decoded value 
helps decide if motion_vertical_backward_r 
appears in the bitstream. This parameter is 
present only when [motion backward] = “1” in 
Tables 12.9 through 12.12. 


Motion_vertical_backward_r 

This optional binary number (of 
backward_r_size bits) is used to help decode 
the backward motion vectors. It is present only 
when [motion backward] = “1” in Tables 12.9 
through 12.12, backward_f_code # “001,” and 
motion_vertical_backward_code # “0.” 


Coded_block_pattern 

This optional variable-length codeword is used 
to derive the coded block pattern (CBP) as 
shown in Table 12.14. It is present only if 
[coded pattern] = “1” in Tables 12.9 through 
12.12, and indicates which blocks in the mac- 
roblock have at least one transform coefficient 
transmitted. The coded block pattern number 
is represented as: 


PoP PoP3P4P5 


where P,, = “1” for any coefficient present for 
block [n], else P, = “0.” Block numbering is 
given in Figure 12.2. 


End_of_macroblock 
This optional 1-bit field has a value of “1.” It is 
present only for D frames. 


Block Layer 


Data for each block layer consists of coeffi- 
cient data. The structure is shown in Figure 
12.5. 


Dct_dc_size_luminance 

This optional variable-length codeword is used 
with intra-coded Y blocks. It specifies the num- 
ber of bits used for dct_dc_differential. The 
variable-length codewords are shown in Table 
12.15. 
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Coded Coded Coded 
Block Code Block Code Block Code 
Pattern Pattern Pattern 
60 111 9 0010 110 43 0001 0000 
4 1101 17 0010 101 25 0000 1111 
8 1100 33 0010 100 37 0000 1110 
16 1011 6 0010 011 26 0000 1101 
32 1010 10 0010 010 38 0000 1100 
12 10011 18 0010 001 29 0000 1011 
48 1001 0 34 0010 000 45 0000 1010 
20 1000 1 7 0001 1111 53 0000 1001 
40 1000 0 11 0001 1110 57 0000 1000 
28 01111 19 0001 1101 30 0000 0111 
44 01110 35 0001 1100 46 0000 0110 
52 0110 1 13 0001 1011 54 0000 0101 
56 01100 49 0001 1010 58 0000 0100 
1 0101 1 21 0001 1001 31 0000 0011 1 
61 0101 0 41 0001 1000 47 0000 0011 0 
2 0100 1 14 0001 0111 55 0000 0010 1 
62 0100 0 50 0001 0110 59 0000 0010 0 
24 0011 11 22 0001 0101 27 0000 0001 1 
36 0011 10 42 0001 0100 39 0000 0001 0 
3 0011 01 15 0001 0011 
63 0011 00 51 0001 0010 
5 0010 111 23 0001 0001 


Table 12.14. MPEG 1 Variable-Length Code Table for coded_block_pattern. 


DCT DC DCT DC 
Size Code Size Code 
Luminance Luminance 

0) 100 5 1110 
1 00 6 11110 
2 01 7 1111 10 
3 101 8 1111 110 
4 110 


Table 12.15. MPEG 1 Variable-Length Code Table for dct_dc_size_luminance. 
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Dct_dc_differential 
This optional variable-length codeword is 
present after dct_dc_size_luminance _ if 
dct_dc_size_luminance # “0.” The values are 
shown in Table 12.16. 


Dct_dc_size_chrominance 

This optional variable-length codeword is used 
with intra-coded Cb and Cr blocks. It specifies 
the number of bits used for dct_dc_differential. 
The variable-length codewords are shown in 
Table 12.17. 


Dct_dc_differential 
This optional variable-length codeword is 
present after dct_dc_size_chrominance _ if 
dct_dc_size_chrominance # “0.” The values are 
shown in Table 12.16. 


Det_coefficient_first 

This optional variable-length codeword is used 
for the first DCT coefficient in non-intra-coded 
blocks, and is defined in Tables 12.18 and 
12.19. 


Det_coefficient_next 

Up to 63 optional variable-length codewords 
present only for I, P and B frames. They are 
the DCT coefficients after the first one, and are 
defined in Tables 12.18 and 12.19. 


End_of_block 

This 2-bit value is used to indicate that no addi- 
tional non-zero coefficients are present. The 
value of this parameter is “10.” 
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DCT DC Differential Size nd cnen ai, 
-255 to -128 8 1111110 11111110 00000000 to 01111111 
-127 to -64 7 111110 1111110 0000000 to 0111111 
-63 to -32 6 11110 111110 000000 to 011111 

-31 to -16 5 1110 11110 00000 to 01111 
-15 to -8 4 110 1110 0000 to 0111 
-7 to -4 3 101 110 000 to 011 
-3 to -2 2 01 10 00 to 01 
Zi. 1 00 01 0 
0 0 100 00 
1 1 00 01 1 
2to3 2 01 10 10 to 11 
4 to 7 3 101 110 100 to 111 
8 to 15 4 110 1110 1000 to 1111 
16 to 31 5 1110 11110 10000 to 11111 
32 to 63 6 11110 111110 100000 to 111111 
64 to 127 7 111110 1111110 1000000 to 1111111 
128 to 255 8 1111110 11111110 10000000 to 11111111 


Table 12.16. MPEG 1 Variable-Length Code Table for dct_dc_differential. 


DCT DC DCT DC 
Size Code Size Code 
Chrominance Chrominance 

0) 00 5 11110 
1 01 6 1111 10 
2 10 7 1111 110 
3 110 8 1111 1110 
4 1110 


Table 12.17. MPEG 1 Variable-Length Code Table for dct_dc_size_chrominance. 
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Run Level Code Run Level Code 

end_of_block 10 escape 0000 01 

O (note 2) 1 1s 0 5 0010 0110s 

O (note 3) 1 lls 0 6 0010 0001 s 
1 1 Olls 1 3 0010 0101s 
0 2 0100 s 3 2 0010 0100s 
2 1 0101s 10 1 0010 0111s 
0 3 0010 1s 11 1 0010 0011s 
3 1 0011 1s 12 1 0010 0010s 
4 1 0011 0s 13 1 0010 0000 s 
1 2 0001 10 s 0 7 0000 0010 10s 
5 1 0001 11s 1 4 0000 0011 00 s 
6 1 0001 01 s 2 3 0000 0010 11s 
7 1 0001 00 s 4 2 0000 0011 11s 
0 4 0000 110 s 5 2 0000 0010 01s 
2 2 0000 100 s 14 1 0000 0011 10s 
8 1 0000 111s 15 1 0000 0011 01s 
9 1 0000 101 s 16 1 0000 0010 00 s 


Notes: 

1. s = sign of level; “0” for positive; s = “1” for negative. 
2: Used for dct_coefficient_first 

3: Used for dct_coefficient_next. 


Table 12.18a. MPEG 1 Variable-Length Code Table for dct_coefficient_first and 
dct_coefficient_next. 
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Run Level Code Run Level Code 
0 8 0000 0001 1101s 0 12 0000 0000 1101 0s 
0 9 0000 0001 1000 s (0) 13 0000 0000 1100 1s 
0 10 0000 0001 0011s 0 14 0000 0000 1100 0s 
0 11 0000 0001 0000 s (0) 15 0000 0000 1011 1s 
1 5 0000 0001 1011s 1 6 0000 0000 1011 0s 
2 4 0000 0001 0100 s 1 7 0000 0000 1010 1s 
3 3 0000 0001 1100s 2 5 0000 0000 1010 0s 
4 3 0000 0001 0010 s 3 4 0000 0000 1001 1s 
6 2 0000 0001 1110s 5 3 0000 0000 1001 0s 
7 2 0000 0001 0101 s 9 2 0000 0000 1000 1 s 
8 2 0000 0001 0001 s 10 2 0000 0000 1000 0s 
17 1 0000 0001 1111s 22 1 0000 0000 1111 1s 
18 1 0000 0001 1010s 23 1 0000 0000 1111 0s 
19 1 0000 0001 1001 s 24 1 0000 0000 1110 1s 
20 1 0000 0001 0111s 25 1 0000 0000 1110 0s 
21 1 0000 0001 0110s 26 1 0000 0000 1101 1s 
Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 12.18b. MPEG 1 Variable-Length Code Table for dct_coefficient_first and 


dct_coefficient_next. 
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Run Level Code Run Level Code 
0 16 0000 0000 011111 s 0 40 0000 0000 0010 000 s 
0 17 0000 0000 011110 s 1 8 0000 0000 0011 111s 
0 18 0000 0000 0111 01s 1 9 0000 0000 0011 110s 
0 19 0000 0000 011100 s 1 10 0000 0000 0011 101 s 
(0) 20 0000 0000 011011 s 1 11 0000 0000 0011 100s 
0 21 0000 0000 0110 10 s 1 12 0000 0000 0011 011s 
0 22 0000 0000 0110 01s 1 13 0000 0000 0011 010s 
0 23 0000 0000 0110 00 s 1 14 0000 0000 0011 001 s 
0 24 0000 0000 0101 11s 1 15 0000 0000 0001 0011 s 
0 25 0000 0000 0101 10 s 1 16 0000 0000 0001 0010 s 
0 26 0000 0000 0101 01 s 1 17 0000 0000 0001 0001 s 
0 27 0000 0000 0101 00 s 1 18 0000 0000 0001 0000 s 
0 28 0000 0000 0100 11 s 6 3 0000 0000 0001 0100 s 
0 29 0000 0000 0100 10 s 11 2 0000 0000 0001 1010 s 
0 30 0000 0000 0100 01 s 12 2 0000 0000 0001 1001 s 
0 31 0000 0000 0100 00 s 13 2 0000 0000 0001 1000 s 
0 32 0000 0000 0011 000 s 14 2 0000 0000 0001 0111s 
0 33 0000 0000 0010 111s 15 2 0000 0000 0001 0110 s 
0 34 0000 0000 0010 110s 16 2 0000 0000 0001 0101 s 
0 35 0000 0000 0010 101 s 27 1 0000 0000 0001 1111s 
0 36 0000 0000 0010 100 s 28 1 0000 0000 0001 1110s 
0 37 0000 0000 0010 011 s 29 1 0000 0000 0001 1101 s 
0 38 0000 0000 0010 010 s 30 1 0000 0000 0001 1100 s 
0 39 0000 0000 0010 001 s 31 1 0000 0000 0001 1011s 


Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 12.18c. MPEG 1 Variable-Length Code Table for dct_coefficient_first and 
dct_coefficient_next. 
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Run Level Fixed Length 
Code 
0) 0000 00 
1 0000 01 
2 0000 10 
63 1111 11 
-256 forbidden 
-255 1000 0000 0000 0001 
-254 1000 0000 0000 0010 
-129 1000 0000 0111 1111 
-128 1000 0000 1000 0000 
-127 1000 0001 
-126 1000 0010 
-2 1111 1110 
-1 1111 1111 
(0) forbidden 
1 0000 0001 
127 0111 1111 
128 0000 0000 1000 0000 
129 0000 0000 1000 0001 
255 0000 0000 1111 1111 


Table 12.19. Run, Level Encoding Following An Escape Code for 
dct_coefficient_first and dct_coefficient_next. 
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System Bitstream 


The system bitstream multiplexes the audio 
and video bitstreams into a single bitstream, 
and formats it with control information into a 
specific protocol as defined by MPEG 1. 

Packet data may contain either audio or 
video information. Up to 32 audio and 16 video 
streams may be multiplexed together. Two 
types of private data streams are also sup- 
ported. One type is completely private; the 
other is used to support synchronization and 
buffer management. 

Maximum packet sizes usually are about 
2,048 bytes, although much larger sizes are 
supported. When stored on CDROM, the 
length of the packs coincides with the sectors. 
Typically, there is one audio packet for every 
six or seven video packets. 

Figure 12.6 illustrates the system bit- 
stream, a hierarchical structure with three lay- 
ers. From top to bottom the layers are: 


ISO/IEC 11172 Layer 
Pack 
Packet 


ISO/IEC 11172 Layer 


ISO_11172_end_code 
This 32-bit field has a value of 000001B9y and 
terminates a system bitstream. 


Pack Layer 


Data for each pack consists of a pack header 
followed by a system header (optional) and 
packet data. The structure is shown in Figure 
12.6. 


Pack_start_code 
This 32-bit field has a value of 000001BAy and 
identifies the start of a pack. 


Fixed_bits 
These 4 bits always have a value of “0010.” 


System_clock_reference_32-30 
The system_clock_reference (SCR) is a 33-bit 
number coded using three fields separated by 
marker bits. 

system_clock_reference indicates the 
intended time of arrival of the last byte of the 
system_clock_reference field at the input of the 
decoder. The value of system_clock_reference is 
the number of 90 kHz clock periods. 


Marker_bit 
This bit always has a value of “1.” 


System_clock_reference_29-15 


Marker_bit 
This bit always has a value of “1.” 


System_clock_reference_14-0 


Marker_bit 
This bit always has a value of “1.” 


Marker_bit 
This bit always has a value of “1.” 


Mux_rate 

This 22-bit binary number specifies the rate at 
which the decoder receives the bitstream. It 
specifies units of 50 bytes per second, rounded 
upwards. A value of zero is not allowed. 
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Figure 12.6. MPEG 1 System Bitstream Layer Structures. Marker and reserved bits not shown. 


Marker_bit 
This bit always has a value of “1.” 


System Header 


System_header_start_code 
This 32-bit field has a value of 000001BBy and 
identifies the start of a system header. 


Header_length 

This 16-bit binary number specifies the num- 
ber of bytes in the system header following 
header_length. 


Marker_bit 
This bit always has a value of “1.” 


Rate_bound 

This 22-bit binary number specifies an integer 
value greater than or equal to the maximum 
value of mux_rate. It may be used by the 


decoder to determine if it is capable of decod- 
ing the entire bitstream. 


Marker_bit 
This bit always has a value of “1.” 


Audio_bound 

This 6-bit binary number, with a range of 0-32, 
specifies an integer value greater than or equal 
to the maximum number of simultaneously 
active audio streams. 


Fixed_flag 
This bit specifies fixed bit rate (“1”) or variable 
bit rate (“0”) operation. 


CSPS_flag 

This bit specifies whether the bitstream is a 
constrained system parameter stream (“1”) or 
not (“0”). 


552 = Chapter 12: MPEG 1 


System_audio_lock_flag 

This bit has a value of “1” if there is a constant 
relationship between the audio sampling rate 
and the decoder’s system clock frequency. 


System_video_lock_flag 

This bit has a value of “1” if there is a constant 
relationship between the video picture rate and 
the decoder’s system clock frequency. 


Marker_bit 
This bit always has a value of “1.” 


Video_bound 

This 5-bit binary number, with a range of 0-16, 
specifies an integer value greater than or equal 
to the maximum number of simultaneously 
active video streams. 


Reser ved_byte 
These eight bits always have a value of “1111 
1111.” 


Stream_ID 

This optional 8-bit field, as defined in Table 
12.20, indicates the type and stream number to 
which the following STD_buffer_bound_scale 
and STD_buffer_size_bound fields refer to. 
Each audio and video stream present in the 
system bitstream must be specified only once 
in each system header. 


Fixed_bits 
This optional 2-bit field has a value of “11.” Itis 
present only if stream_ID is present. 


Stream Type lag 
all audio streams 1011 1000 
all video streams 1011 1001 
reserved stream 1011 1100 
private stream 1 1011 1101 
padding stream 1011 1110 
private stream 2 1011 1111 
audio stream number xxxxx 110x xxxx 
video stream number xxxx 1110 xxxx 
reserved data stream number xxxx__1111 xxxx 


Table 12.20. MPEG 1 stream_ID Code. 


STD_buffer_bound_scale 

This optional 1-bit field specifies the scaling 
factor used to interpret 
STD_buffer_size_bound. For an audio stream, it 
has a value of “0.” For a video stream, it has a 
value of “1.” For other stream types, it can be 
either a “O” or a “1.” It is present only if 
stream_ID is present. 


STD_buffer_size_bound 

This optional 13-bit binary number specifies a 
value greater than or equal to the maximum 
decoder input buffer size. If 
STD_buffer_bound_scale = “0,” then 
STD_buffer_size_bound measures the size in 
units of 128 bytes. If STD_buffer_bound_scale = 
“1,” then STD_buffer_size_bound measures the 
size in units of 1024 bytes. It is present only if 
stream_ID is present. 
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Packet_start_code_prefix 

This 24-bit field has a value of 000001y. 
Together with the stream_ID that follows, it 
indicates the start of a packet. 


Stream_ID 
This 8-bit binary number specifies the type and 
number of the bitstream present, as defined in 
Table 12.20. 


Packet_length 

This 16-bit binary number specifies the num- 
ber of bytes in the packet after the 
packet_length field. 


Stuffing _byte 
This optional parameter has a value of “1111 
1111.” Up to 16 consecutive stuffing_bytes may 
used to meet the requirements of the storage 
medium. It is present only if stream_ID # pri- 
vate stream 2. 


STD_bits 

These optional two bits have a value of “01” 
and indicate the STVD_buffer_scale and 
STD_buffer_size fields follow. This field is 
present only if stream_ID # private stream 2. 


STD_buffer_scale 

This optional 1-bit field specifies the scaling 
factor used to interpret STD_buffer_size. For an 
audio stream, it has a value of “0.” For a video 
stream, it has a value of “1.” For other stream 
types, it can be either a “0” or a “1.” This field is 
present only if STD_bits is present and 
stream_ID private stream 2. 


STD_buffer_size 

This optional 13-bit binary number specifies 
the size of the decoder input buffer. If 
STD_buffer_scale = “0,” then STD_buffer_size 
measures the size in units of 128 bytes. If 
STD_buffer_scale = “1,” then STD_buffer_size 
measures the size in units of 1024 bytes. This 
field is present only if STD_bits is present and 
stream_ID # private stream 2. 


PTS_bits 

These optional four bits have a value of “0010” 
and indicate the following presentation time 
stamps are present. This field is present only if 
stream_ID ¥ private stream 2. 


Presentation_time_stamp_32-30 

The optional presentation_time_stamp (PTS) is 
a 33-bit number coded using three fields, sepa- 
rated by marker bits. PTS indicates the 
intended time of display by the decoder. The 
value of PTS is the number of periods of a 90 
kHz system clock. This field is present only if 
PTS_bits is present and stream_ID # private 
stream 2. 


Marker_bit 

This optional bit always has a value of “1.” It is 
present only if PTS_bits is present and 
stream_ID # private stream 2. 


Presentation_time_stamp_29-15 
This optional field is present only if PTS_bits is 
present and stream_ID # private stream 2. 


Marker_bit 

This optional bit always has a value of “1.” It is 
present only if PTS_bits is present and 
stream_ID # private stream 2. 
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Presentation_time_stamp_14-0 
This optional field is present only if PT'S_bits is 
present and stream_ID # private stream 2. 


Marker_bit 

This optional 1-bit field always has a value of 
“1.” It is present only if PT'S_bits is present and 
stream_ID # private stream 2. 


DTS_bits 

These optional four bits have a value of “0011” 
and indicate the following presentation and 
decoding time stamps are present. This field is 
present only if stream_ID # private stream 2. 


Presentation_time_stamp_32-30 

The optional presentation_time_stamp (PTS) is 
a 33-bit number coded using three fields, sepa- 
rated by marker bits. PTS indicates the 
intended time of display by the decoder. The 
value of PTS is the number of periods of a 90 
kHz system clock. This field is present only if 
DTS_bits is present and stream_ID # private 
stream 2. 


Marker_bit 

This optional 1-bit field always has a value of 
“1.” Itis present only if DTS_bits is present and 
stream_ID # private stream 2. 


Presentation_time_stamp_29-15 
This optional field is present only if DTS_bits is 
present and stream_ID # private stream 2. 


Marker_bit 

This optional 1-bit field always has a value of 
“1.” Itis present only if DTS_bits is present and 
stream_ID # private stream 2. 


Presentation_time_stamp_ 14-0 
This optional field is present only if DTS_bits is 
present and stream_ID # private stream 2. 


Marker_bit 

This optional 1-bit field always has a value of 
“1.” It is present only if DTS_bits is present and 
stream_ID # private stream 2. 


Fixed_bits 

This optional 4-bit field has a value of “0001.” It 
is present only if DTS_bits is present and 
stream_ID # private stream 2. 


Decoding_time_stamp_32-30 

The optional decoding_time_stamp (DTS) is a 
33-bit number coded using three fields, sepa- 
rated by marker bits. DTS indicates the 
intended time of decoding by the decoder of 
the first access unit that commences in the 
packet. The value of DTS is the number of peri- 
ods of a 90 kHz system clock. It is present only 
if DTS_bits is present and stream_ID # private 
stream 2. 


Marker_bit 

This optional 1-bit field always has a value of 
“1.” Itis present only if DTS_bits is present and 
stream_ID # private stream 2. 


Decoding_time_stamp_29-15 
This optional field is present only if DTS_bits is 
present and stream_ID # private stream 2. 


Marker_bit 

This optional 1-bit field always has a value of 
“1.” It is present only if DTS_bits is present and 
stream_ID # private stream 2. 
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Decoding_time_stamp_14—0 
This optional field is present only if DTS_bits is 
present and stream_ID # private stream 2. 


Marker_bit 

This optional 1-bit field always has a value of 
“1.” Itis present only if DTS_bits is present and 
stream_ID # private stream 2. 


NonPTS_nonDTS_bits 

These optional eight bits have a value of “0000 
1111” and are present if the PTS_bits field (and 
the corresponding fields) or the DTS_bits field 
(and the corresponding fields) are not present. 


Packet_data_byte 

This is [n] bytes of data from the bitstream 
specified by the packet layer stream_ID. The 
number of data bytes may be determined from 
the packet_length parameter. 


Video Decoding 


A system demultiplexer parses the system bit- 
stream, demultiplexing the audio and video bit- 
streams. 

The video decoder essentially performs 
the inverse of the encoder. From the coded 
video bitstream, it reconstructs the I frames. 
Using I frames, additional coded data, and 
motion vectors, the P and B frames are gener- 
ated. Finally, the frames are output in the 
proper order. 


Fast Playback Considerations 


Fast for ward operation can be implemented by 
using D frames or the decoding only of I 
frames. However, decoding only I frames at the 
faster rate places a major burden on the trans- 
mission medium and the decoder. 

Alternately, the source may be able to sort 
out the desired I frames and transmit just 
those frames, allowing the bit rate to remain 
constant. 


Pause Mode Considerations 


This requires the decoder to be able to control 
the incoming bitstream. If it doesn’t, when 
playback resumes there may be a delay and 
skipped frames. 


Reverse Playback Considerations 


This requires the decoder to be able to decode 
each group of pictures in the forward direc- 
tion, store them, and display them in reverse 
order. To minimize the storage requirements 
of the decoder, groups of pictures should be 
small or the frames may be reordered. Reor- 
dering can be done by transmitting frames in 
another order or by reordering the coded pic- 
tures in the decoder buffer. 


Decode Postprocessing 


The SIF data usually is converted to 720 x 480 
(NTSC) or 720 x 576 (PAL) interlaced resolu- 
tion. Suggested upsampling filters are dis- 
cussed in the MPEG 1 specification. The 
original decoded lines correspond to field 1. 
Field 2 uses interpolated lines. 
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Real-World Issues 


System Bitstream Termination 


A common error is the improper placement of 
sequence_end_code in the system bitstream. 
When this happens, some decoders may not 
know that the end of the video occurred, and 
output garbage. 

Another problem occurs when a system bit- 
stream is shortened just by eliminating trailing 
frames, removing sequence_end_code altogether. 
In this case, the decoder may be unsure when 
to stop. 


Timecodes 


Since some decoders rely on the timecode 
information, it should be implemented. To min- 
imize problems, the video bitstream should 
start with a timecode of zero and increment by 
one each frame. 


Variable Bit Rates 


Although variable bit rates are supported, a 
constant bit rate should be used if possible. 
Since vbv_delay doesn’t make sense for a vari- 
able bit rate, the MPEG 1 standard specifies 
that it be set to the maximum value. 

However, some decoders use vbv_delay 
with variable bit rates. This could result in a 2- 
3 second delay before starting video, causing 
the first 60-90 frames to be skipped. 


Constrained Bitstreams 


Most MPEG 1 decoders can handle only the 
constrained parameters subset of MPEG 1. To 
ensure maximum compatibility, only the con- 
strained parameters subset should be used. 


Source Sample Clock 


Good compression with few artifacts requires 
a video source that generates or uses a very 
stable sample clock. This ensures that the ver- 
tical alignment of samples over the entire pic- 
ture is maintained. With poorly designed 
sample clock generation, the artifacts usually 
get worse towards the right side of the picture. 


References 


1. Digital Video Magazine, “Not All MPEGs 
Are Created Equal,” by John Toebes, Doug 
Walker, and Paul Kaiser, August 1995. 

2. Digital Video Magazine, “Squeeze the Most 
From MPEG,” by Mark Magel, August 
1995. 

3. ISO/IEC 11172-1, Coding of moving pic- 
tures and associated audio for digital stor- 
age media at up to about 1.5 Mbit/s, Part 
1: Systems. 

4. ISO/IEC 11172-2, Coding of moving pic- 
tures and associated audio for digital stor- 
age media at up to about 1.5 Mbit/s, Part 
2: Video. 

5. ISO/IEC 11172-3, Coding of moving pic- 
tures and associated audio for digital stor- 
age media at up to about 1.5 Mbit/s, Part 
3: Audio. 


ee 
MPEG 2 


MPEG 2 extends MPEG 1 to cover a wider 
range of applications. The MPEG 1 chapter 
should be reviewed to become familiar with 
the basics of MPEG before reading this chap- 
ter. 

The primary application targeted during 
the definition process was all-digital transmis- 
sion of broadcast-quality video at bit rates of 4— 
9 Mbps. However, MPEG 2 is useful for many 
other applications, such as HDTV, and now 
supports bit rates of 1.5-60 Mbps. 

MPEG 2 is an ISO standard (ISO/IEC 
13818), and consists of nine parts: 


systems ISO/IEC 13818-1 
video ISO/IEC 13818-2 
audio ISO/IEC 13818-3 


conformance testing ISO/IEC 13818-4 
software simulation ISO/IEC 13818-5 
DSM-CC extensions ISO/IEC 13818-6 
advanced audio coding ISO/IEC 13818-7 
RTI extension ISO/IEC 13818-9 


DSM-CC conformance ISO/IEC 13818-10 


As with MPEG 1, the compressed bit- 
streams implicitly define the decompression 
algorithms. The compression algorithms are 
up to the individual manufacturers, within the 
scope of an international standard. 


In addition to audio/video multiplexing, a 
major feature of the Systems specification 
(ISO/IEC 13818-1) is the more accurate syn- 
chronization of audio, video, and data. Addi- 
tional functions include random access, 
identification of information carried within the 
stream, procedures to support user access 
control, and error protection mechanisms. 

The Digital Storage Media Command and 
Control (DSM-CC) extension (ISO/IEC 13818- 
6) facilitates the managing of MPEG bit- 
streams and equipment by providing a com- 
mon command and _ control interface. 
Capabilities include synchronized download 
services, opportunistic data services, resource 
announcements in broadcast and interactive 
services, and data broadcasting. 

The audio extension (ISO/IEC 13818-7) 
adds non-backwards-compatible audio modes, 
such as the Dolby Digital 5.1 standard. 

The Real Time Interface (RTI) extension 
(ISO/IEC 13818-9) defines a common inter- 
face point to which terminal equipment manu- 
facturers and network operators can design. 
RTI will specify a delivery model for the bytes 
of an MPEG-2 System stream at the input of a 
real decoder, whereas MPEG-2 System defines 
an idealized byte delivery schedule. 
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Audio Overview 


In addition to the non-backwards-compatible 
audio extension (ISO/IEC 13818-7), MPEG 2 
supports up to five full-bandwidth channels 
compatible with MPEG 1 audio coding. It also 
extends the coding of MPEG 1 audio to half 
sampling rates (16 kHz, 22.05 kHz, and 24 
kHz) for improved quality for bit rates at or 
below 64 kbps per channel. 

MPEG 2.5 is an unofficial, yet common, 
extension to the audio capabilities of MPEG 2. 
It adds sampling rates of 8 kHz, 11.025 kHz, 
and 12 kHz. 


Video Overview 


With MPEG 2, profiles specify the syntax (.e., 
algorithms) and levels specify various parame- 
ters (resolution, frame rate, bit rate, etc.). Main 
Profile@Main Level is targeted for SDTV appli- 
cations, while Main Profile@High Level is tar- 
geted for HDTV applications. 


Levels 


MPEG 2 supports four levels, which specify 
resolution, frame rate, coded bit rate, and so on 
for a given profile. 


Low Level (LL) 

MPEG 1 Constrained Parameters Bitstream 
(CPB) supports up to 352 x 288 at up to 30 
frames per second. Maximum bit rate is 4 
Mbps. 


Main Level (ML) 

MPEG 2 Constrained Parameters Bitstream 
(CPB) supports up to 720 x 576 at up to 30 
frames per second and is intended for SDTV 
applications. Maximum bit rate is 15-20 Mbps. 


High 1440 Level 

This Level supports up to 1440 x 1088 at up to 
60 frames per second and is intended for 
HDTV applications. Maximum bit rate is 60-80 
Mbps. 


High Level (HL) 

High Level supports up to 1920 x 1088 at up to 
60 frames per second and is intended for 
HDTV applications. Maximum bit rate is 80- 
100 Mbps. 


Profiles 


MPEG 2 supports six profiles, which specify 
which coding syntax (algorithms) is used. 
Tables 13.1 through 13.8 illustrate the various 
combinations of levels and profiles allowed. 


Simple Profile (SP) 

Main profile without the B frames, intended for 
software applications and perhaps digital cable 
TV. 


Main Profile (MP) 

Supported by most MPEG 2 decoder chips, it 
should satisfy 90% of the SDTV applications. 
Typical resolutions are shown in Table 13.6. 


Multiview Profile (MVP) 

By using existing MPEG 2 tools, it is possible 
to encode video from two cameras shooting 
the same scene with a small angle between 
them. 


4:2:2 Profile (422P) 

Previously known as “studio profile,” this pro- 
file uses 4:2:2 YCbCr instead of 4:2:0, and with 
main level, increases the maximum bit rate up 
to 50 Mbps (300 Mbps with high level). It was 
added to support pro-video SDTV and HDTV 
requirements. 
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Profile 
Level Nonscalable Scalable 

Simple Main Multiview 4:2:2 SNR Spatial High 
High - yes - yes - - yes 
High 1440 - yes - - - yes yes 
Main yes yes yes yes yes - yes 
Low - yes - - yes - - 

Table 13.1. MPEG 2 Acceptable Combinations of Levels and Profiles. 
Profile 
Constraint Nonscalable Scalable 

Simple Main Multiview 4:2:2 SNR Spatial High 
chroma format 4:2:0 4:2:0 4:2:0 4:2:2 4:2:0 4:2:0 ae 
picture types I,P I,PB IPB I,PB IPB I,PB IPB 

SNR or SNR or 

scalable modes - - - - SNR Spatial Spatial 
purates “2h 8,9, 10 8,9, 10 8,9, 10 8,9, 10 8,9, 10 8,9,10 | 8,9, 10,11 
precision (bits) 
sequence scalable m a 4 
Fe ION 0 0 no 0 yes yes yes 
picture spatial 
scalable no no no no no yes yes 
extension 
repeat first constrained unconstrained 


field 


Table 13.2. Some MPEG 2 Profile Constraints. 
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Level 


Maximum Number of 
Layers 


Profile 


SNR Spatial 


High 


High 


All layers (base + enhancement) 
Spatial enhancement layers 
SNR enhancement layers 


High 
1440 


All layers (base + enhancement) 
Spatial enhancement layers 
SNR enhancement layers 


! 
= 


Main 


All layers (base + enhancement) 
Spatial enhancement layers 
SNR enhancement layers 


RPRwWwlReRwl]Re Rw 


Low 


All layers (base + enhancement) 
Spatial enhancement layers 
SNR enhancement layers 


rFPOoON|rROMm 


Table 13.3. MPEG 2 Number of Permissible Layers for Scalable Profiles. 


Profile Profile at Level 
erafile Base Enhancement Enhancement oe 
Layer Layer 1 Layer 2 
SNR 4:2:0 SNR, 4:2:0 - MP at same level 
4:2:0 SNR, 4:2:0 - MP at same level 
. 4:2:0 Spatial, 4:2:0 - 
Spatial : 
4:2:0 SNR, 4:2:0 Spatial, 4:2:0 MP at (level-1) 
4:2:0 Spatial, 4:2:0 SNR, 4:2:0 
4:2:0 or 4:2:2 - - 
4:2:0 SNR, 4:2:0 - HP at same level 
4:2:0 or 4:2:2 SNR, 4:2:2 - 
4:2:0 Spatial, 4:2:0 - 
: 4:2:0 or 4:2:2 Spatial, 4:2:2 - 
High F 
4:2:0 SNR, 4:2:0 Spatial, 4:2:0 or 4:2:2 
4:2:0 or 4:2:2 SNR, 4:2:2 Spatial, 4:2:2 HP at (level-1) 
4:2:0 Spatial, 4:2:0 SNR, 4:2:0 or 4:2:2 
4:2:0 Spatial, 4:2:2 SNR, 4:2:2 
4:2:2 Spatial, 4:2:2 SNR, 4:2:2 


Table 13.4. Some MPEG 2 Video Decoder Requirements for Various Profiles. 
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Spatial Profile 
Level Resolution Parameter SNR 
Layer Simple Main Multiview | 4:2:2 Spatial High 
Samples per line 1920 1920 1920 
Enhancement Lines per frame - 1088 - 1088 - 1088 
: Frames per second 60 60 60 
High : 
Samples per line 960 
Lower Lines per frame - - - - - 576 
Frames per second 30 
Samples per line 1440 1440 1440 
Enhancement Lines per frame - 1088 - - 1088 1088 
High Frames per second 60 60 60 
1440 Samples per line 720 720 
Lower Lines per frame - - - - 576 576 
Frames per second 30 30 
Samples per line 720 720 720 720 720 720 
Enhancement Lines per frame 576 576 576 608 576 576 
. Frames per second 30 30 30 30 30 30 
Main ; 
Samples per line 352 
Lower Lines per frame - - - - - 288 
Frames per second 30 
Samples per line 352 352 
Enhancement Lines per frame - 288 - - 288 - 
Frames per second 30 30 
Low 7 
Samples per line 
Lower Lines per frame - - - - - - 
Frames per second 
Notes: 


1. The above levels and profiles that originally specified 1152 maximum lines per frame were 
changed to 1088 lines per frame. 


Table 13.5. MPEG 2 Upper Limits of Resolution and Temporal Parameters. In the case of 
single layer or SNR scalability coding, the “Enhancement Layer” parameters apply. 
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Level 


Maximum 
Bit Rate 
(Mbps) 


Typical Active 
Resolutions 


Refresh Rate 
(Hz) 


Typical Active 
Resolutions 


Refresh Rate? 
(Hz) 


High 


80 
(800 for 4:2:2 
Profile) 


1920 x 1080! 


23.976p 


24p 


2050p 


29.97p 


30p 


50i 


59.941 


60i 


High 1440 


60 


1280 x 720 


23.976p 


24p 


2050p 


29.97p 


30p 


S0p 


59.94p 


60p 


1440 x 1080! 


50i 


59.94: 


60i 


Main 


15 
(50 for 4:2:2 
Profile) 


352 x 480 


29.97p 


352 x 576 


25p 


544 x 480 


29.97p 


544 x 576 


25p 


640 x 480 


29.97p 


704 x 480 


29.97p 


704 x 576 


25p 


720 x 480 


29.97p 


720 x 576 


25p 


Low 


320 x 240 


29.97p 


352 x 240 


29.97p 


352 x 288 


25p 


Notes: 


1. The video coding system requires that the number of active scan lines be a multiple of 32 for 
interlaced pictures, and a multiple of 16 for progressive pictures. Thus, for the 1080-line inter- 
laced format, the video encoder and decoder must actually use 1088 lines. The extra eight 
lines are “dummy” lines having no content, and designers choose dummy data that simplifies 
the implementation. The extra eight lines are always the last eight lines of the encoded image. 
These dummy lines do not carry useful information, but add little to the data required for 
transmission. 

2. p= progressive; i = interlaced. 


Table 13.6. Example Levels and Resolutions for MPEG 2 Main Profile. 
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Spatial Profile 
Level Resolution 
Layer Simple Main SNR/Spatial High 
62.668800 (4:2:2) 
sigs Enhancement - 62.668800 - 83.558400 (4:2:0) 
i 
Lower _ _ es 14.745600 (4:2:2) 
19.660800 (4:2:0) 
47.001600 (4:2:2) 
High Enhancement - 47.001600 47.001600 62.668800 (4:2:0) 
1440 11.059200 (4:2:2) 
Lower - - 10.368000 14.745600 (4:2:0) 
11.059200 (4:2:2) 
Main Enhancement 10.368000 10.368000 10.368000 14.745600 (4:2:0) 
Lower - - - 3.041280 (4:2:0) 
Enhancement - 3.041280 3.041280 - 
Low 
Lower - - - - 


Table 13.7. MPEG 2 Upper Limits for Y Sample Rate (Msamples/second). In the case of 
single layer or SNR scalability coding, the “Enhancement Layer” parameters apply. 


Profile 
Level Nonscalable Scalable 
Simple Main Multiview 4:2:2 SNR/Spatial High 
100 (all layers) 
High - 80 - 300 - 80 (middle + base layers) 
25 (base layer) 
60 (all layers) 80 (all layers) 
High 1440 - 60 - - 40 (middle + base layers) 60 (middle + base layers) 
15 (base layer) 20 (base layer) 
20 (all layers) 
Main 15 15 15 50 15 (both layers) iis Guiddle + base layers) 
10 (base layer) 
4 (base layer) 
4 (both layers) 
Low = id a a 3 (base layer) = 
Table 13.8. MPEG 2 Upper Limits for Bit Rates (Mbps). 
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SNR and Spatial Profiles 
Adds support for SNR scalability and/or spa- 
tial scalability. 


High Profile (HP) 

Supported by MPEG 2 decoder chips targeted 
for HDTV applications. Typical resolutions are 
shown in Table 13.6. 


Scalability 


The MPEG 2 SNR, Spatial, and High profiles 
support four scalable modes of operation. 
These modes break MPEG 2 video into layers 
for the purpose of prioritizing video data. Scal- 
ability is not commonly used since efficiency 
decreases by about 2 dB (or about 30% more 
bits are required). 


SNR Scalability 

This mode is targeted for applications that 
desire multiple quality levels. All layers have 
the same spatial resolution. The base layer pro- 
vides the basic video quality. The enhance- 
ment layer increases the video quality by 
providing refinement data for the DCT coeffi- 
cients of the base layer. 


Spatial Scalability 

Useful for simulcasting, each layer has a differ- 
ent spatial resolution. The base layer provides 
the basic spatial resolution and temporal rate. 
The enhancement layer uses the spatially inter- 
polated base layer to increase the spatial reso- 
lution. For example, the base layer may 
implement 352 x 240 resolution video, with the 
enhancement layers used to generate 704 x 
480 resolution video. 


Temporal Scalability 
This mode allows migration from low temporal 
rate to higher temporal rate systems. The base 


layer provides the basic temporal rate. The 
enhancement layer uses temporal prediction 
relative to the base layer. The base and 
enhancement layers can be combined to pro- 
duce a full temporal rate output. All layers have 
the same spatial resolution and chroma for- 
mats. In case of errors in the enhancement lay- 
ers, the base layer can be used for 
concealment. 


Data Partitioning 

This mode is targeted for cell loss resilience in 
ATM networks. It breaks the 64 quantized 
transform coefficients into two bitstreams. The 
higher priority bitstream contains critical 
lower-frequency DCT coefficients and side 
information such as headers and motion vec- 
tors. A lower-priority bitstream carries higher- 
frequency DCT coefficients that add detail. 


Transport and Program Streams 


The MPEG 2 Systems Standard specifies two 
methods for multiplexing the audio, video, and 
other data into a format suitable for transmis- 
sion and storage. 

The Program Stream is designed for appli- 
cations where errors are unlikely. It contains 
audio, video, and data bitstreams (also called 
elementary bitstreams) all merged into a single 
bitstream. The program stream, as well as 
each of the elementary bitstreams, may be a 
fixed or variable bit rate. DVDs use program 
streams, carrying the DVD-specific data in pri- 
vate data streams interleaved with the various 
video and audio streams. 

The Transport Stream, using fixed-size 
packets of 188 bytes, is designed for applica- 
tions where data loss is likely. Also containing 
audio, video, and data bitstreams all merged 
into a single bitstream, multiple programs can 
be carried. The DVB and ATSC digital televi- 
sion standards use transport streams. 
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Both the Transport Stream and Program 
Stream are based on a common packet struc- 
ture, facilitating common decoder implementa- 
tions and conversions. Both streams are 
designed to support a large number of known 
and anticipated applications, while retaining 
flexibility. 


Video Encoding 


YCbCr Color Space 


MPEG 2 uses the YCbCr color space, support- 
ing 4:2:0, 4:2:2, and 4:4:4 sampling. The 4:2:2 
and 4:4:4 sampling options increase the 
chroma resolution over 4:2:0, resulting in bet- 
ter picture quality. 

The 4:2:0 sampling structure for MPEG 2 
is shown in Figures 3.8 through 3.10. The 4:2:2 
and 4:4:4 sampling structures are shown in 
Figures 3.2 and 3.3. 


Coded Picture Types 


There are three types of coded pictures. I 
(intra) pictures are fields or frames coded as a 
stand-alone still image. They allow random 
access points within the video stream. As such, 
I pictures should occur about two times a sec- 
ond. I pictures also should be used where 
scene cuts occur. 

P (predicted) pictures are fields or frames 
coded relative to the nearest previous I or P 
picture, resulting in forward prediction pro- 
cessing, as shown in Figure 13.1. P pictures 
provide more compression than I pictures, 
through the use of motion compensation, and 
are also a reference for B pictures and future P 
pictures. 


B (bidirectional) pictures are fields or 
frames that use the closest past and future I or 
P picture as a reference, resulting in bidirec- 
tional prediction, as shown in Figure 13.1. B 
pictures provide the most compression. and 
decrease noise by averaging two pictures. Typ- 
ically, there are two B pictures separating I or 
P pictures. 

D (DC) pictures are not supported in 
MPEG 2, except for decoding to support back- 
wards compatibility with MPEG 1. 

A group of pictures (GOP) is a series of 
one or more coded pictures intended to assist 
in random accessing and editing. The GOP 
value is configurable during the encoding pro- 
cess. The smaller the GOP value, the better 
the response to movement (since the I pictures 
are closer together), but the lower the com- 
pression. 

In the coded bitstream, a GOP must start 
with an I picture and may be followed by any 
number of IJ, P or B pictures in any order. In 
display order, a GOP must start with an I or B 
picture and end with an I or P picture. Thus, 
the smallest GOP size is a single I picture, with 
the largest size unlimited. 

Each GOP should be coded independently 
of any other GOP. However, this is not true 
unless no B pictures precede the first I picture, 
or if they do, they use only backward motion 
compensation. This results in both open and 
closed GOP formats. A closed GOP is a GOP 
that can be decoded without using pictures of 
the previous GOP for motion compensation. 
An open GOP identified by the broken_link flag, 
indicates that the first B pictures (if any) imme- 
diately following the first I picture after the 
GOP header may not be decoded correctly 
(and thus not be displayed) since the reference 
picture used for prediction is not available due 
to editing. 


566 Chapter 13: MPEG 2 


PICTURE DISPLAY 
ORDER 


FORWARD 
PREDICTION 


PICTURE TRANSMIT 
ORDER 3 


[| INTRA (1) PICTURE 


a PREDICTED (P) PICTURE 


2 6 ¢ 5 


BI-DIRECTIONAL 
PREDICTION 


[| BI-DIRECTIONAL (B) PICTURE 


Figure 13.1. MPEG 2 I, P, and B Pictures. Some pictures can be transmitted out of sequence, 
complicating the interpolation process and requiring picture reordering by the MPEG decoder. 


Arrows show inter-frame dependencies. 


Motion Compensation 


Motion compensation for MPEG 2 is more 
complex due to the introduction of fields. After 
a macroblock has been compressed using 
motion compensation, it contains both the spa- 
tial difference (motion vectors) and content 
difference (error terms) between the refer- 
ence macroblock and macroblock being 
coded. 

The two major classifications of prediction 
are field and frame. Within field pictures, only 
field predictions are used. Within frame pic- 
tures, either field or frame predictions can be 
used (selectable at the macroblock level). 

Motion vectors for MPEG 2 always are 
coded in half-pixel units. MPEG 1 supports 
either half-pixel or full-pixel units. 


16 x 8 Motion Compensation Option 

Two motion vectors (four for B pictures) per 
macroblock are used, one for the upper 16 x 8 
region of a macroblock and one for the lower 
16 x 8 region of a macroblock. It is only used 
with field pictures. 


Dual-Prime Motion Compensation Option 
This is only used with P pictures that have no 
B pictures between the predicted and refer- 
ence fields of frames. One motion vector is 
used, together with a small differential motion 
vector. All of the necessary predictions are 
derived from these. 
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Macroblocks 


Three types of macroblocks are available in 
MPEG 2. 

The 4:2:0 macroblock (Figure 13.2) con- 
sists of four Y blocks, one Cb block, and one Cr 
block. The block ordering is shown in the fig- 
ure. 

The 4:2:2 macroblock (Figure 13.3) con- 
sists of four Y blocks, two Cb blocks, and two 
Cr blocks. The block ordering is shown in the 
figure. 

The 4:4:4 macroblock (Figure 13.4) con- 
sists of four Y blocks, four Cb blocks, and four 
Cr blocks. The block ordering is shown in the 
figure. 

Macroblocks in P pictures are coded using 
the closest previous I or P picture as a refer- 
ence, resulting in two possible codings: 


e intra coding 
no motion compensation 


e forward prediction 
closest previous I or P picture is the 
reference 


Macroblocks in B pictures are coded using 
the closest previous and/or future I or P pic- 
ture as a reference, resulting in four possible 
codings: 


¢ intra coding 
no motion compensation 


e forward prediction 
closest previous I or P picture is the 
reference 


e backward prediction 
closest future I or P picture is the 
reference 


e bi-directional prediction 
two pictures used as the reference: 
the closest previous I or P picture and 
the closest future I or P picture 


| Pictures 


Macroblocks 
There are ten types of macroblocks in I pic- 
tures, as shown in Table 13.25. 

If the [macroblock quant] column in Table 
13.25 has a “1,” the quantizer scale is transmit- 
ted. For the remaining macroblock types, the 
DCT correction is coded using the previous 
value for quantizer scale. 

If the [coded pattern] column in Table 
13.25 has a “1,” the 6-bit coded block pattern is 
transmitted as a variable-length code. This tells 
the decoder which of the six blocks in the 4:2:0 
macroblock are coded (“1”) and which are not 
coded (“0”). Table 13.30 lists the codewords 
assigned to the 63 possible combinations. 
There is no code for when none of the blocks 
are coded; it is indicated by the macroblock 
type. For 4:2:2 and 4:4:4 macroblocks, an addi- 
tional 2 or 6 bits, respectively, are used to ex- 
tend coded block pattern. 


DCT 

Each 8 x 8 block (of input samples or predic- 
tion error terms) is processed by an 8 x 8 DCT 
(discrete cosine transform), resulting in an 8 x 
8 block of horizontal and vertical frequency 
coefficients, as shown in Figure 7.47. 

Input sample values are 0-255, resulting in 
a range of 0-2,040 for the DC coefficient and a 
range of about —2,048 to 2,047 for the AC coef- 
ficients. 

Due to spatial and SNR scalability, non- 
intra blocks (blocks within a non-intra macro- 
block) are also possible. Non-intra block coeffi- 
cients represent differences between sample 
values rather than actual sample values. They 
are obtained by subtracting the motion-com- 
pensated values from the previous picture 
from the values in the current macroblock. 
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Figure 13.2. 


Figure 13.3. 


Figure 13.4. 
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MPEG 2 4:2:0 Macroblock Structure. 
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MPEG 2 4:2:2 Macroblock Structure. 
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LEFT RIGHT 


MPEG 2 4:4:4 Macroblock Structure. 
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Quantizing 

The 8 x 8 block of frequency coefficients are 
uniformly quantized, limiting the number of 
allowed values. The quantizer step scale is 
derived from the quantization matrix and quan- 
tizer scale and may be different for different 
coefficients and may change between macro- 
blocks. 

Since the eye is sensitive to large luma 
areas, the quantizer step size of the DC coeffi- 
cient is selectable to 8, 9, 10, or 11 bits of preci- 
sion. The quantized DC coefficient is deter- 
mined by dividing the DC coefficient by 8, 4, 2, 
or 1 and rounding to the nearest integer. 

AC coefficients are quantized using two 
quantization matrices: one for intra macro- 
blocks and one for non-intra macroblocks. 
When using 4:2:2 or 4:4:4 data, different matri- 
ces may be used for Y and CbCr data. Each 
quantization matrix has a default set of values 
that may be overwritten. 

If the [macroblock quant] column in Table 
13.25 has a “1,” the quantizer scale is transmit- 
ted. For the remaining macroblock types, the 
DCT correction is coded using the previous 
value for quantizer scale. 


Zig-Zag Scan 

Zig-zag scanning, starting with the DC compo- 
nent, generates a linear stream of quantized 
frequency coefficients arranged in order of 
increasing frequency, as shown in Figures 7.50 
and 7.51. This produces long runs of zero coef- 
ficients. 


Coding of Quantized DC Coefficients 
After the DC coefficients have been quantized, 
they are losslessly coded. 

Coding of Y blocks within a macroblock fol- 
lows the order shown in Figures 13.2 through 
13.4. The DC value of block 4 is the DC predic- 
tor for block 1 of the next macroblock. At the 
beginning of each slice, whenever a macro- 


block is skipped, or whenever a non-intra mac- 
roblock is decoded, the DC predictor is set to 
128 (if 8 bits of DC precision), 256 (if 9 bits of 
DC precision), 512 (if 10 bits of DC precision), 
or 1,024 Gf 11 bits of DC precision). 

The DC values of each Cb and Cr block are 
coded using the DC value of the correspond- 
ing block of the previous macroblock as a pre- 
dictor. At the beginning of each. slice, 
whenever a macroblock is skipped, or when- 
ever a non-intra block is decoded, the DC pre- 
dictors are set to 128 (8 bits of DC precision), 
256 (9 bits of DC precision), 512 (10 bits of DC 
precision), or 1,024 (11 bits of DC precision). 

However, a common implementation is to 
reset the DC predictors to zero and center the 
intra-block DC terms about zero instead of the 
50% grey level. Decoders then only have to 
handle the different intra DC precisions in the 
quantizer (which already has a multiplier that 
can be used to reconstruct the right value) 
instead of the parser (which generally doesn’t 
touch that data and has no multiplier). 


Coding of Quantized AC Coefficients 

After the AC coefficients have been quantized, 
they are scanned in the zig-zag order shown in 
Figure 7.50 or 7.51 and coded using run-length 
and level. The scan starts in position 1, as 
shown in Figures 7.50 and 7.51, as the DC coef- 
ficient in position 0 is coded separately. 

The run-lengths and levels are coded as 
shown in Tables 13.34 and 13.35. The “s” bit 
denotes the sign of the level; “0” is positive and 
“1” is negative. For intra blocks, either Table 
13.34 or Table 13.35 may be used, as specified 
by intra_vlc_format in the bitstream. For non- 
intra blocks, only Table 13.34 is used. 

For run-level combinations not shown in 
Tables 13.34 and 13.35, an escape sequence is 
used, consisting of the escape code (ESC), fol- 
lowed by the run-length and level codes from 
Tables 13.36 and 13.37. 
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After the last DCT coefficient has been 
coded, an EOB code is added to tell the 
decoder that there are no more quantized coef- 
ficients in this 8 x 8 block. 


P Pictures 


Macroblocks 

There are 26 types of macroblocks in P pic- 
tures, as shown in Table 13.26, due to the addi- 
tional complexity of motion compensation. 

Skipped macroblocks are present when 
the macroblock_address_increment parameter 
in the bitstream is greater than 1. For P field 
pictures, the decoder predicts from the field of 
the same parity as the field being predicted, 
motion vector predictors are set to 0, and the 
motion vector is set to 0. For P frame pictures, 
the decoder sets the motion vector predictors 
to 0, and the motion vector is set to 0. 

If the [macroblock quant] column in Table 
13.26 has a “1,” the quantizer scale is transmit- 
ted. For the remaining macroblock types, the 
DCT correction is coded using the previous 
value for quantizer scale. 

If the [motion forward] column in Table 
13.26 has a “1,” horizontal and vertical forward 
motion vectors are successively transmitted. 

If the [coded pattern] column in Table 
13.26 has a “1,” the 6-bit coded block pattern is 
transmitted as a variable-length code. This tells 
the decoder which of the six blocks in the mac- 
roblock are coded (“1”) and which are not 
coded (“0”). Table 13.30 lists the codewords 
assigned to the 63 possible combinations. 
There is no code for when none of the blocks 
are coded; it is indicated by the macroblock 
type. For intra-coded macroblocks in P and B 
pictures, the coded block pattern is not trans- 
mitted, but is assumed to be a value of 63 (all 
blocks are coded). For 4:2:2 and 4:4:4 macro- 
blocks, an additional 2 or 6 bits, respectively, 
are used to extend coded block pattern. 


DCT 

Intra block AC coefficients are transformed in 
the same manner as they are for I pictures. 
Intra block DC coefficients are transformed 
differently; the predicted values are set to 
1,024, unless the previous block was intra- 
coded. 

Non-intra block coefficients represent dif- 
ferences between sample values rather than 
actual sample values. They are obtained by 
subtracting the motion compensated values of 
the previous picture from the values in the cur- 
rent macroblock. There is no prediction of the 
DC value. 

Input sample values are -255 to +255, 
resulting in a range of about —2,000 to +2,000 
for the AC coefficients. 


Quantizing 
Intra blocks are quantized in the same manner 
as they are for I pictures. 

Non-intra blocks are quantized using the 
quantizer scale and the non-intra quantization 
matrix. The AC and DC coefficients are quan- 
tized in the same manner. 


Coding of Intra Blocks 

Intra blocks are coded the same way as I pic- 
ture intra blocks. There is a difference in the 
handling of the DC coefficients in that the pre- 
dicted value is 128, unless the previous block 
was intra coded. 


Coding of Non-Intra Blocks 

The coded block pattern (CBP) is used to 
specify which blocks have coefficient data. 
These are coded similarly to the coding of intra 
blocks, except the DC coefficient is coded in 
the same manner as the AC coefficients. 
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B Pictures 


Macroblocks 
There are 34 types of macroblocks in B pic- 
tures, as shown in Table 13.27, due to the addi- 
tional complexity of backward motion 
compensation. 

For B field pictures, the decoder predicts 
from the field of the same parity as the field 
being predicted. The direction of prediction 
(forward, backward, or bidirectional) is the 
same as the previous macroblock, motion vec- 
tor predictors are unaffected, and the motion 
vectors are taken from the appropriate motion 
vector predictors. For B frame pictures, the 
direction of prediction (forward, backward, or 
bidirectional) is the same as the previous mac- 
roblock, motion vector predictors are unaf- 
fected, and the motion vectors are taken from 
the appropriate motion vector predictors. 

If the [macroblock quant] column in Table 
13.27 has a “1,” the quantizer scale is transmit- 
ted. For the rest of the macroblock types, the 
DCT correction is coded using the previous 
value for the quantizer scale. 

If the [motion forward] column in Table 
13.27 has a “1,” horizontal and vertical for ward 
motion vectors are successively transmitted. If 
the [motion backward] column in Table 13.27 
has a “1,” horizontal and vertical backward 
motion vectors are successively transmitted. If 
both forward and backward motion types are 
present, the vectors are transmitted in this 
order: 


horizontal forward 
vertical forward 
horizontal backward 
vertical backward 


If the [coded pattern] column in Table 
13.27 has a “1,” the 6-bit coded block pattern is 
transmitted as a variable-length code. This tells 
the decoder which of the six blocks in the mac- 
roblock are coded (“1”) and which are not 
coded (“0”). Table 13.30 lists the codewords 
assigned to the 63 possible combinations. 
There is no code for when none of the blocks 
are coded; this is indicated by the macroblock 
type. For intra-coded macroblocks in P and B 
pictures, the coded block pattern is not trans- 
mitted, but is assumed to be a value of 63 (all 
blocks are coded). For 4:2:2 and 4:4:4 macro- 
blocks, an additional 2 or 6 bits respectively, 
are used to extend coded block pattern. 


Coding 

DCT coefficients of blocks are transformed 
into quantized coefficients and coded in the 
same way they are for P pictures. 


Video Bitstream 


Figure 13.5 illustrates the video bitstream, a 
hierarchical structure with seven layers. From 
top to bottom the layers are: 


Video Sequence 
Sequence Header 
Group of Pictures (GOP) 
Picture 

Slice 

Macroblock (MB) 
Block 


Several extensions may be used to support 
various levels of capability. These extensions 
are: 


Sequence Extension 
Sequence Display Extension 
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Figure 13.5. MPEG 2 Video Bitstream Layer Structures. Marker and reserved bits not shown. 
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Sequence Scalable Extension 
Picture Coding Extension 

Quant Matrix Extension 

Picture Display Extension 

Picture Temporal Scalable Extension 
Picture Spatial Scalable Extension 


If the first sequence header of a video 
sequence is not followed by an extension start 
code (000001B5y), then the video bitstream 
must conform to the MPEG 1 video bitstream. 

For MPEG 2 video bitstreams, an exten- 
sion start code (000001B5,) and a sequence 
extension must follow each sequence header. 


Video Sequence 


Sequence_end_code 
This 32-bit field has a value of 000001B7y and 
terminates a video sequence. 


Sequence Header 


A sequence header should occur about every 
one-half second. The structure is shown in Fig- 
ure 13.5. If not followed by a sequence exten- 
sion, the bitstream conforms to MPEG 1. 


Sequence_header_code 

This 32-bit string has a value of 000001B3y and 
indicates the beginning of a sequence header. 
The sequence_end_code, which terminates the 
video sequence, has a value of 000001B7,, 


Horizontal_size_value 

This is the 12 least significant bits of the width 
(in samples) of the viewable portion of the Y 
component. The two most significant bits of 
the 14bit value are specified in the 
horizontal_size_extension. A value of zero is not 
allowed. 


Vertical_size_value 

This is the 12 least significant bits of the height 
(in scan lines) of the viewable portion of the Y 
component. The two most significant bits of 
the 14bit value are specified in the 
vertical_size_extension. A value of zero is not 
allowed. 


Aspect_ratio_information 
This 4-bit codeword indicates either the sam- 
ple aspect ratio (SAR) or display aspect ratio 
(DAR) as shown in Table 13.9. 

If sequence_display_extension is not 
present, the SAR is determined as follows: 


SAR = DAR (horizontal_size / vertical_size) 


If sequence_display_extension is present, 
the SAR is determined as follows: 


SAR = DAR x (display_horizontal_size / 
display_vertical_size) 


Frame_rate_code 
This 4-bit codeword indicates the frame rate, 
as shown in Table 13.10. 

The actual frame rate is determined as fol- 
lows: 


Srame_rate = frame_rate_value x 
(frame_rate_extension_n + 1) + 
(frame_rate_extension_d + 1) 


When an entry is specified in Table 13.10, 
both frame_rate_extension_n and 
fSrame_rate_extension_d are “00.” If 
progressive_sequence is “1,” the time between 
two frames at the output of the decoder is the 
reciprocal of the frame_rate. If 
progressive_sequence is “0,” the time between 
two frames at the output of the decoder is one- 
half of the reciprocal of the frame_rate. 
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SAR DAR Code 


forbidden forbidden 0000 


1.0000 - 0001 


- 3/4 0010 


- 9/16 0011 


- 1/2.21 0100 


- 0101 


- 0110 


- 0111 


- 1000 


- 1001 


- reserved 1010 


- 1011 


- 1100 


- 1101 


- 1110 


- 1111 


Table 13.9. MPEG 2 aspect_ratio_information Codewords. 


a casa Code 
forbidden 0000 
24/1.001 0001 

24 0010 

25 0011 
30/1.001 0100 

30 0101 

50 0110 
60/1.001 0111 

60 1000 
reserved 1001 
reserved 1010 
reserved 1011 
reserved 1100 
reserved 1101 
reserved 1110 
reserved 1111 


Table 13.10. MPEG 2 frame_rate_code Codewords. 
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Bit_rate_value 

The 18 least significant bits of a 30-bit binary 
number. The 12 most significant bits are in the 
bit_rate_extension. This specifies the bitstream 
bit rate, measured in units of 400 bps, rounded 
upwards. A zero value is not allowed. 


Marker_bit 
Always a “1.” 


Vbv_buffer_size_value 

The 10 least significant bits of a 18-bit binary 
number. The 8 most significant bits are in the 
vbv_buffer_size_extension. Defines the size of 
the Video Buffering Verifier needed to decode 
the sequence. It is defined as: 


B = 16 x 1024 x vbv_buffer_size 


Constrained_parameters_flag 
This bit is set to a “O” since it has no meaning 
for MPEG 2. 


Load_intra_quantizer_matrix 

This bit is set to a “1” if an 
intra_quantizer_matrix follows. If set to a “0,” 
the default values below are used for intra 
blocks (both Y and CbCr) until the next occur- 
rence of a sequence header’ or 
quant_matrix_extension. 


Intra_quantizer_matrix 

An optional list of sixty-four 8-bit values that 
replace the current values. A value of zero is 
not allowed. The value for intra_quant [0, 0] is 
always 8. These values take effect until the 
next occurrence of a sequence header or 
quant_matrix_extension. For 4:2:2 and 4:4:4 
data formats, the new values are used for both 
the Y and CbCr intra matrix, unless a different 
ChbCr intra matrix is loaded. 


Load_non_intra_quantizer_matrix 

This bit is set to a “1” if a 
non_intra_quantizer_matrix follows. If set to a 
“0,” the default values below are used for non- 
intra blocks (both Y and CbCr) until the next 
occurrence of a sequence header or 
quant_matrix_extension. 


Non_intra_quantizer_matrix 

An optional list of sixty-four 8-bit values that 
replace the current values. A value of zero is 
not allowed. These values take effect until the 
next occurrence of a sequence header or 
quant_matrix_extension. For 4:2:2 and 4:4:4 
data formats, the new values are used for both 
Y and CbCr non-intra matrix, unless a different 
CbCr non-intra matrix is loaded. 
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User Data 


User_data_start_code 

This optional 32-bit string of 000001B2y indi- 
cates the beginning of user_data. user_data 
continues until the detection of another start 
code. 


User_data 

These n x 8 bits are present only if 
user_data_start_code is present. user_data 
must not contain a string of 23 or more consec- 
utive zero bits. 


Sequence Extension 


A sequence extension may only occur after a 
sequence header. 


Extension_start_code 
This 32-bit string of 000001B5y, indicates the 
beginning of extension data beyond MPEG 1. 


Extension_start_code_ID 

This 4-bit field has a value of “0001” and indi- 
cates the beginning of a sequence extension. 
For MPEG 2 video bitstreams, a sequence 
extension must follow each sequence header. 


Profile_and_level_indication 
This 8-bit field specifies the profile and level, as 
shown in Table 13.11. 


Bit 7: escape bit 
Bits 6-4: profile ID 
Bits 3-0: level ID 


Progressive_sequence 

A “1” for this bit indicates only progressive pic- 
tures are present. A “0” indicates both frame 
and field pictures may be present, and frame 
pictures may be progressive or interlaced. 


Chroma_format 

This 2-bit codeword indicates the CbCr format, 
as shown in Table 13.12. For the ATSC stan- 
dard, the value must be “01.” 


Horizontal_size_extension 

The two most significant bits of horizontal_size. 
For the ATSC standard, the value must be 
“O00.” 


Vertical_size_extension 

The two most significant bits of vertical_size. 
For the ATSC standard, the value must be 
“O00.” 


Bit_rate_extension 

The 12 most significant bits of bit_rate. For the 
ATSC standard, the value must be “0000 0000 
0000.” 


Marker_bit 
Always a “1.” 


vbv_buffer_size_extension 

The eight most _ significant bits of 
vbv_buffer_size. For the ATSC standard, the 
value must be “0000 0000.” 


Low_delay 
A “1” for this bit indicates that no B pictures 
are present, so no frame reordering delay. 


Frame_rate_extension_n 

See frame_rate_code regarding this 2-bit 
binary value. For the ATSC standard, the value 
must be “00.” 


Frame_rate_extension_d 

See frame_rate_code regarding this 5-bit 
binary value. For the ATSC standard, the value 
must be “00000.” 
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EXTENSION EXTENSION PROFILE 


PROGRESSIVE 


HORIZONTAL VERTICAL 


CHROMA BIT RATE 


START START AND SIZE SIZE 
SEQUENCE FORMAT EXTENSION 
CODE CODE ID LEVEL EXTENSION EXTENSION 
VBV BUFFER Low FRAME RATE | FRAME RATE 
SIZE EXTENSION EXTENSION 
DELAY 
EXTENSION N D 


Figure 13.6. MPEG 2 Sequence Extension Structure. Marker bits not shown. 


Sequence Display Extension 


Extension_start_code_ID 

This 4-bit field has a value of “0010” and indi- 
cates the beginning of a sequence display 
extension. Information provided by this exten- 
sion does not affect the decoding process and 
may be ignored. It allows the display of the 
decoded pictures to be as accurate as possible. 


Video_format 
This 3-bit codeword indicates the source of the 
pictures prior to MPEG encoding, as shown in 
Table 13.14. For the ATSC standard, the value 
must be “000.” 


Color_description 

A “1” for this bit indicates that color_primaries, 
transfer_characteristics, and matrix_coefficients 
are present in the bitstream. 


Color_primaries 

This optional 8-bit codeword describes the 
chromaticity coordinates of the source prima- 
ries, as shown in Table 13.15. If 
sequence_display_extension is not present, or 
color_description = “0,” the indicated default 
value must be used. 


This information may be used to adjust the 
color processing after MPEG 2 decoding to 
compensate for the color primaries of the dis- 


play. 


Transfer_characteristics 
This optional 8-bit codeword describes the 
optoelectronic transfer characteristic of the 
source picture, as shown in Table 13.16. If 
sequence_display_extension is not present, or 
color_description = “0,” the indicated default 
value must be used. 

This information may be used to adjust the 
processing after MPEG 2 decoding to compen- 
sate for the gamma of the display. 


Matrix_coefficients 
This optional 8-bit codeword describes the 
coefficients used in deriving YCbCr from 
R’G'B’, as shown in Table 13.17. If 
sequence_display_extension is not present, or 
color_description = “0,” the indicated default 
value must be used. 

This information is used to select the 
proper YCbCr-to-RGB matrix, if needed, after 
MPEG 2 decoding. 
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Profile Profile ID Code Level Level ID Code 
reserved 000 reserved 0000 
high 001 reserved 0001 
spatial scalable 010 reserved 0010 
SNR scalable 011 reserved 0011 
main 100 high 0100 
simple 101 reserved 0101 
reserved 110 high 1440 0110 
reserved 111 reserved 0111 

main 1000 
reserved 1001 
low 1010 
reserved 1011 
reserved 1100 
reserved 1101 
reserved 1110 
reserved 1111 
Table 13.11. MPEG 2 profile_and_level_indication Codewords. 


Chroma Format Code 
reserved 00 
4:2:0 01 
4:2:2 10 
4:4:4 11 


Table 13.12. MPEG 2 chroma_format Codewords. 
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EXTENSION 
VIDEO COLOR COLOR 
START 
FORMAT DESCRIPTION | PRIMARIES 
CODE ID 


TRANSFER 
CHARACTERISTICS 


DISPLAY 
HORIZONTAL 


DISPLAY 


MATRIX 
VERTICAL 


COEFFICIENTS 


SIZE SIZE 


Figure 13.7. MPEG 2 Sequence Display Extension Structure. 


Display_horizontal_size 
See display_vertical_size regarding this 14-bit 
binary number. 


Marker_bit 
Always a “1.” 


Display_vertical_size (14 bits) 

This 14-bit binary number, in conjunction with 
display_horizontal_size, defines the active 
region of the display. If the display region is 
smaller than the encoded picture size, only a 
portion of the picture will be displayed. If the 
display region is larger than the picture size, 
the picture will be displayed on a portion of the 
display. 


Sequence Scalable Extension 


Extension_start_code_ID 

This 4-bit field has a value of “0101” and indi- 
cates the beginning of a sequence scalable 
extension. This extension specifies the scal- 
ability modes implemented for the video bit- 
stream. If sequence_scalable_extension is not 
present in the bitstream, no scalability is used. 
The base layer of a scalable hierarchy does not 
have a sequence_scalable_extension, except in 
the case of data partitioning. 


Scalable_mode 

This 2-bit codeword indicates the scalability 
type of the video sequence as shown in Table 
13.13. 


Scalable Mode Code 
data partitioning 00 
spatial scalability 01 
SNR scalability 10 
temporal scalability 11 


Table 13.13. MPEG 2 scalable_mode 
Codewords. 


Layer_ID 

This 4-bit binary number identifies the layers 
in a scalable hierarchy. The base layer has an 
ID of “0000.” During data partitioning, 
layer_ID “0000” is assigned to partition layer 
zero and layer_ID “0001” is assigned to parti- 
tion layer one. 


Lower_layer_prediction_horizontal_size 
This optional 14-bit binary number is present 
only if scalable_mode = “01.” It indicates the 
horizontal size of the lower layer frame used 
for prediction. It contains the value of 
horizontal_size in the lower layer bitstream. 
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sane pais 
component 000 
PAL 001 
NTSC 010 
SECAM 011 
MAC 100 
unspecified 101 
reserved 110 
reserved 111 


Table 13.14. MPEG 2 video_format Codewords. 


Color Primaries Code Application Default 
forbidden 0000 0000 
MPEG 2, ATSC, 
BT.709, SMPTE 274M 0000 0001 DVB 25Hz HDTV, 
DVB 30Hz HDTV 
unspecified 0000 0010 
reserved 0000 0011 
BT.470 system M 0000 0100 
BT.470 system B, G, I 0000 0101 DVB 25Hz SDTV 
SMPTE 170M 0000 0110 DVB 30Hz SDTV 
SMPTE 240M 0000 0111 
reserved 0000 1000 
reserved 1111 1111 


Table 13.15. MPEG 2 color_primaries Codewords. 
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Opto-Electronic 


Transfer Characteristics Code Application Default 
forbidden 0000 0000 
MPEG 2, ATSC, 
BT.709, SMPTE 274M 0000 0001 DVB 25Hz HDTV, 
DVB 30Hz HDTV 
unspecified 0000 0010 
reserved 0000 0011 
BT.470 system M 0000 0100 
BT.470 system B, G, I 0000 0101 DVB 25Hz SDTV 
SMPTE 170M 0000 0110 DVB 30Hz SDTV 
SMPTE 240M 0000 0111 
linear 0000 1000 
reserved 0000 1001 
reserved 1111 1111 


Table 13.16. MPEG 2 transfer_characteristics Codewords. 


Color Primaries Code Application Default 
forbidden 0000 0000 
MPEG 2, ATSC, 
BT.709, SMPTE 274M 0000 0001 DVB 25Hz HDTV, 
DVB 30Hz HDTV 
unspecified 0000 0010 
reserved 0000 0011 
FCC 0000 0100 
BT.470 system B, G, I 0000 0101 DVB 25Hz SDTV 
SMPTE 170M 0000 0110 DVB 30Hz SDTV 
SMPTE 240M 0000 0111 
reserved 0000 1000 
reserved 11111111 


Table 13.17. MPEG 2 matrix_coefficients Codewords. 
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Figure 13.8. MPEG 2 Sequence Scalable Extension Structure. Marker bits not shown. 


Marker_bit 
Always a “1.” It is present only if scalable_mode 
= “Ol.” 


Lower_layer_prediction_vertical_size 

This optional 14-bit binary number is present 
only if scalable_mode = “01.” It indicates the 
vertical size of the lower layer frame used for 
prediction. It contains the value of vertical_size 
in the lower layer bitstream. 


Horizontal_subsampling_factor_m 

This optional 5-bit binary number is present 
only if scalable_mode = “01,” and affects the 
spatial upsampling process. A value of “00000” 
is not allowed. 


Horizontal_subsampling_factor_n 

This optional 5-bit binary number is present 
only if scalable_mode = “01,” and affects the 
spatial upsampling process. A value of “00000” 
is not allowed. 


Ver tical_subsampling_factor_m 

This optional 5-bit binary number is present 
only if scalable_mode = “01,” and affects the 
spatial upsampling process. A value of “00000” 
is not allowed. 


Vertical_subsampling_factor_n 

This optional 5-bit binary number is present 
only if scalable_mode = “01,” and affects the 
spatial upsampling process. A value of “00000” 
is not allowed. 


Picture_mux_enable 

This optional 1-bit field is present only if 
scalable_mode = “11.” If set to a “1,” the 
picture_mux_order and_ picture_mux_factor 
parameters are used for remultiplexing prior to 
display. 


Mux _to_progressive_sequence 

This optional 1-bit field is present only if 
scalable_mode = “11” and picture_mux_enable = 
“1.” If set to a “1,” it indicates the decoded pic- 
tures are to be temporally multiplexed to gen- 
erate a progressive sequence for display. 
When temporal multiplexing is to generate an 
interlaced sequence, this flag is a “0.” 


Picture_mux_order 

This optional 3-bit binary number is present 
only if scalable_mode = “11.” It specifies the 
number of enhancement layer pictures prior to 
the first base layer picture. It is used to assist 
the decoder in properly remultiplexing pic- 
tures prior to display. 
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Picture_mux_factor 

This optional 3-bit binary number is present 
only if scalable_mode = “11.” It denotes the 
number of enhancement layer pictures 
between consecutive base layer pictures, and 
is used to assist the decoder in properly remul- 
tiplexing pictures prior to display. 


Group of Pictures (GOP) Layer 


A GOP header should occur about every 2 sec- 
onds. Data for each group of pictures consists 
of a GOP header followed by picture data. The 
structure is shown in Figure 13.5. The DVD 
standard uses user data extensions at this 
layer for closed captioning data. 


Group_start_code 
This 32-bit string has a value of 000001B8y and 
indicates the beginning of a group of pictures. 


Time_code (25 bits) 

These 25 bits indicate timecode information, as 
shown in Table 13.19. [drop_frame_flag] may 
be set to “1” only if the picture rate is 30/1.001 
(29.97) Hz. 


Closed_gop 

This 1-bit flag is set to “1” if the group of pic- 
tures has been encoded without motion vec- 
tors referencing the previous group of 
pictures. This bit allows support of editing the 
compressed bitstream. 


Broken_link 

This 1-bit flag is set to a “O” during encoding. It 
is set to a “1” during editing when the B frames 
following the first I frame of a group of pictures 
cannot be correctly decoded. 


Picture Layer 


Data for each picture consists of a picture 
header followed by slice data. The structure is 
shown in Figure 13.5. If a sequence extension 
is present, each picture header is followed by a 
picture coding extension. 

Some implementations enable frame-accu- 
rate switching of aspect ratio information via 
user data extensions at this layer. The ATSC 
DTV standard also uses user data extensions 
at this layer for EIA-708 closed captioning data. 


Picture_start_code 
This 32-bit string has a value of 00000100y. 


Temporal_reference 

For the first frame in a GOP the 10-bit binary 
number temporal_reference is zero. It is then 
incremented by one, modulo 1024, for each 
frame in the display order. When a frame is 
coded as two fields, the temporal reference of 
both fields is the same. 


Picture_coding_type 

This 3-bit codeword indicates the picture type 
(I picture, P picture, or B picture) as shown in 
Table 13.18. 


Picture Type Code 
forbidden 000 
I picture 001 
P picture 010 
B picture 011 
forbidden 100 
reserved 101 
reserved 110 
reserved 111 


Table 13.18. MPEG 2 picture_coding_type 
Codewords. 
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Vbv_delay 

For constant bit rates, this 16-bit binary num- 
ber sets the initial occupancy of the decoding 
buffer at the start of decoding a picture so that 
it doesn’t overflow or underflow. 


Full_pel_forward_vector 

This optional 1-bit field is not used for MPEG 
2, so has a value of “0.” It is present only if 
picture_coding_type = “010” or “011.” 


Forward_f_code 

This optional 3-bit field is not used for MPEG 
2, so has a value of “111.” It is present only if 
picture_coding_type = “010” or “011.” 


Full_pel_backward_vector 

This optional 1-bit field is not used for MPEG 
2, so has a value of “0.” It is present only if 
picture_coding_type = “011.” 


Backward_f_code 

This optional 3-bit field is not used for MPEG 
2, so has a value of “111.” It is present only if 
picture_coding_type = “011.” 


Extra_bit_picture 
A bit which, when set to “1,” indicates that 
extra_information_picture follows. 


Extra_information_picture 

If extra_bit_picture = “1,” then these 9 bits fol- 
low consisting of 8 bits of data 
(extra_information_picture) and then another 
extra_bit_picture to indicate if a further 9 bits 
follow, and so on. 


Picture Coding Extension 


A picture coding extension may only occur fol- 
lowing a picture header. 


Extension_start_code 
This 32-bit string of 000001B5y, indicates the 
beginning of a new set of extension data. 


Extension_start_code_ID 

This 4-bit field has a value of “1000” and indi- 
cates the beginning of a picture coding exten- 
sion. 


: Range Number of 
Fine Cote of waite Bits 
drop_frame_flag 1 
time_code_hours 0-23 5 
time_code_minutes 0-59 6 
marker_bit 1 1 
time_code_seconds 0-59 6 
time_code_pictures 0-59 6 


Table 13.19. MPEG 2 time_code Field. 
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Figure 13.9. MPEG 2 Picture Coding Extension Structure. Marker bits not shown. 


f_code [0,0] 

A 4bit binary number, having a range of 
“0001” to “1001,” that is used for the decoding 
of forward horizontal motion vectors. A value 
of “0000” is not allowed; a value of “1111” indi- 
cates this field is ignored. 


f_code [0,1] 

A 4bit binary number, having a range of 
“0001” to “1001,” that is used for the decoding 
of forward vertical motion vectors. A value of 
“0000” is not allowed; a value of “1111” indi- 
cates this parameter is ignored. 


f_code [1,0] 

A 4bit binary number, having a range of 
“0001” to “1001,” that is used for the decoding 
of backward horizontal motion vectors. A value 
of “0000” is not allowed; a value of “1111” indi- 
cates this field is ignored. 


f_code [1,1] 

A 4-bit code, having a range of “0001” to 
“1001,” that is used for the decoding of back- 
ward vertical motion vectors. A value of “0000” 
is not allowed; a value of “1111” indicates this 
field is ignored. 


Intra_dc_precision 
This 2-bit codeword specifies the intra DC pre- 
cision as shown in Table 13.20. 


Intra "6 aes Code 
8 00 
9 01 
10 10 
11 11 


Table 13.20. MPEG 2 intra_dc_precision 
Codewords. 
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Picture_structure 
This 2-bit codeword specifies the picture struc- 
ture as shown in Table 13.21. 


structs Code 
reserved 00 
top field 01 
bottom field 10 
frame picture 11 


Table 13.21. MPEG 2 picture_structure 
Codewords. 


Top_field_first 

If progressive_sequence = “O,” this bit indicates 
what field is output first by the decoder. In a 
field, this bit has a value of “0.” In a frame, a “1” 
indicates the first field of the decoded frame is 
the top field. A value of “0” indicates the first 
field is the bottom field. 

If progressive_sequence = “1” and 
repeat_first_field = “0,” this bit is a “O” and the 
decoder generates a progressive frame. 

If progressive_sequence = Bt 
repeat_first_field = “1,” and this bit is a “0,” the 
decoder generates two identical progressive 
frames. 

If progressive_sequence = a 
repeat_first_field = “1,” and this bit is a “1,” the 
decoder generates three identical progressive 
frames. 


Frame_pred_frame_dct 

If this bit is a “1,” only frame-DCT and frame 
prediction are used. For field pictures, it is 
always a “0.” This parameter is a “1” if 
progressive_frame is “1.” 


Concealment_motion_vectors 
If this bit is a “1,” it indicates that the motion 
vectors are coded for intra macroblocks. 


Q_scale_type 

This bit indicates which of two mappings 
between quantizer_scale_code and 
quantizer_scale are used by the decoder. 


Intra_vlc_format 

This bit indicates which table is to be used for 
DCT coefficients for intra blocks. Table 13.34 
is used when intra_vlc_format = “0.” Table 
13.35 is used when intra_vic_format = “1.” For 
non-intra blocks, Table 13.34 is used regard- 
less of the value of intra_vlc_format. 


Alternate_scan 

This bit indicates which zigzag scanning pat- 
tern is to be used by the decoder for transform 
coefficient data. “0” = Figure 7.50; “1” = Figure 
7.51. 


Repeat_first_field 
See top_field_first for the use of this bit. For 
field pictures, it has a value of “0.” 


Chroma_420_type 
If chroma_format is 4:2:0, this bit is the same as 
progressive_frame. Other wise, it is a “0.” 


Progressive_frame 

If a “0,” this bit indicates the two fields of the 
frame are interlaced fields, with a time interval 
between them. If a “1,” the two fields of the 
frame are from the same instant in time. 


Composite_display_flag 

This bit indicates whether or not v_axis, 
field_sequence, sub_carrier, burst_amplitude, 
and sub_carrier_phase are present in the bit- 
stream. 
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V_axis 
This bit is present only when 
composite_display_flag = “1.” It is used when 
the original source was a PAL video signal. 
v_axis = “1” on a positive V sign, “0” other wise. 
This information can be obtained from an 
NTSC/PAL decoder that is driving the MPEG 
2 encoder. 


Field_sequence 

This 3-bit codeword is present only when 
composite_display_flag = “1.” It specifies the 
number of the field in the original four- or 
eight-field sequence as shown in Table 13.22. 


Frame Field 
Sequence Sequence cous 
1 1 000 
1 2 001 
2 3 010 
2 4 011 
3 5 100 
3 6 101 
4 7 110 
4 8 111 


Table 13.22. MPEG 2 field_sequence 
Codewords. 


This information can be obtained from an 
NTSC/PAL decoder that is driving the MPEG 
2 encoder. It can be used to enable a MPEG 2 
decoder to set the field sequence of a NTSC/ 
PAL encoder to the same as the original. 


Sub_carrier 
This bit is present only when 
composite_display_flag = “1.” A “0” indicates 
that the original subcarrier-to-line frequency 
relationship was correct. 

This information can be obtained from the 
NTSC/PAL decoder that is driving the MPEG 
2 encoder. 


Burst_amplitude 
This 7-bit binary number is present only when 
composite_display_flag = “1.” It specifies the 
original PAL or NTSC burst amplitude when 
quantized per BT.601 (ignoring the MSB). 
This information can be obtained from an 
NTSC/PAL decoder that is driving the MPEG 
2 encoder. It can be used to enable a MPEG 2 
decoder to set the color burst amplitude of a 
NTSC/PAL encoder to the same as the origi- 
nal. 


Sub_carrier_phase 

This 8-bit binary number is present only when 
composite_display_flag = “1.” It specifies the 
original PAL or NTSC subcarrier phase as 
defined in BT-470. The value is defined as: 
(360° / 256) x sub_carrier_phase. 

This information can be obtained from an 
NTSC/PAL decoder that is driving the MPEG 
2 encoder. It can be used to enable a MPEG 2 
decoder to set the color subcarrier phase of a 
NTSC/PAL encoder to the same as the origi- 
nal. 
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Quant Matrix Extension 


Each quantization matrix has default values. 
When a sequence header is decoded, all matri- 
ces reset to their default values. User-defined 
matrices may be downloaded during a 
sequence header or using this extension. 


Extension_start_code_ID 

This 4-bit string has a value of “0011” and indi- 
cates the beginning of a 
quant_matrix_extension. This extension allows 
quantizer matrices to be transmitted for the 
4:2:2 and 4:4:4 chroma formats. 


Load_intra_quantizer_matrix 

This bit is set to a “1” if an 
intra_quantizer_matrix follows. If set to a “0,” 
the default values below are used for intra 
blocks until the next occurrence of a sequence 
header or guant_matrix_extension. 


Intra_quantizer_matrix 
An optional list of sixty-four 8-bit values that 
replace the default values shown above. A 
value of zero is not allowed. The value for 
intra_quant [0, 0] is always 8. These values 
take effect until the next occurrence of a 
sequence header or quant_matrix_extension. 
The order follows that shown in Figure 7.50. 
For 4:2:2 and 4:4:4 data formats, the new 
values are used for both the Y and CbCr intra 
matrix, unless a different CbCr intra matrix is 
loaded. 


Load_non_intra_quantizer_matrix 

This bit is set to a “l” if a 
non_intra_quantizer_matrix follows. If set to a 
“0,” the default values below are used for non- 
intra blocks until the next occurrence of a 
sequence header or quant_matrix_extension. 


Non-intra_quantizer_matrix 

An optional list of sixty-four 8-bit values that 
replace the default values shown above. A 
value of zero is not allowed. These values take 
effect until the next occurrence of a sequence 
header or quant_matrix_extension. The order 
follows that shown in Figure 7.50. 

For 4:2:2 and 4:4:4 data formats, the new 
values are used for both the Y and CbCr non- 
intra matrix, unless a new CbCr non-intra 
matrix is loaded. 


Load_chroma_intra_quantizer_matrix 
This bit is set to a “l” if a 
chroma_intra_quantizer_matrix follows. If set 
to a “0,” there is no change in the values used. 
If chroma_format is 4:2:0, this bit is a “0.” 


Chroma_intra_quantizer_matrix 

An optional list of sixty-four 8-bit values that 
replace the previous or default values used for 
CbCr data. A value of zero is not allowed. The 
value for chroma_intra_quant [0,0] is always 8. 
These values take effect until the next occur- 
rence of a sequence header’ or 
quant_matrix_extension. The order follows that 
shown in Figure 7.50. 
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Figure 13.10. MPEG 2 Quant Matrix Extension Structure. Marker bits not shown. 


Load_chroma_non_intra_quantizer_matrix 
This bit is set to a “1” if a 
chroma_non_intra_quantizer_matrix follows. If 
set to a “0,” there is no change in the values 
used. If chroma_format is 4:2:0, this bit is a “0.” 


Chroma_non_intra_quantizer_matrix 

An optional list of sixty-four 8-bit values that 
replace the previous or default values used for 
CbCr data. A value of zero is not allowed. 
These values take effect until the next occur- 
rence of a sequence header’ or 
quant_matrix_extension. The order follows that 
shown in Figure 7.50. 


Picture Display Extension 


This extension allows the position of the dis- 
play rectangle to be moved on a picture-by-pic- 
ture basis. A typical application would be 
implementing pan-and-scan. 


Extension_start_code_ID 

This 4-bit field has a value of “0111” and indi- 
cates the beginning of a picture display exten- 
sion. 

In the case of an interlaced sequence, a pic- 
ture may relate to one, two, or three decoded 
fields. Thus, there may be up to three sets of 
the following four fields present in the bit- 
stream. 
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Frame_center_horizontal_offset 

This 16-bit binary number specifies the hori- 
zontal offset in units of 1/16th of a sample. A 
positive value positions the center of the 
decoded picture to the right of the center of 
the display region. 


Marker_bit 
Always a “1.” 


Frame_center_vertical_offset 

This 16-bit binary number specifies the verti- 
cal offset in units of 1/16th of a scan line. A 
positive value positions the center of the 
decoded picture below the center of the dis- 
play region. 


Marker_bit 
Always a “1.” 


Picture Temporal Scalable Extension 


Extension_start_code_ID 
This 4-bit value of “1010” indicates the begin- 
ning ofa picture temporal scalable extension. 


Reference_select_code 


Forward_temporal_reference 
This 10-bit binary number indicates the tempo- 
ral reference of the lower layer to be used to 
provide the forward prediction. If more than 10 
bits are required to specify the temporal refer- 
ence, only the 10 LSBs are used. 


Marker_bit 
Always a “1.” 


Backward_temporal_reference 

This 10-bit binary number indicates the tempo- 
ral reference of the lower layer to be used to 
provide the backward prediction. If more than 
10 bits are required to specify the temporal ref- 
erence, only the 10 LSBs are used. 


Picture Spatial Scalable Extension 


Extension_start_code_ID 
This 4-bit value of “1001” indicates the begin- 
ning of a picture spatial scalable extension. 


Lower_layer_temporal_reference 

This 10-bit binary number indicates the tempo- 
ral reference of the lower layer to be used to 
provide the prediction. If more than 10 bits are 


This 2-bit codeword identifies reference  ‘equired to specify the temporal reference, 
frames or fields for prediction. only the 10 LSBs are used. 
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Figure 13.11. MPEG 2 Picture Display Extension Structure. Marker bits not shown. 
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Marker_bit 
Always a “1.” 


Lower_layer_horizontal_offset 

This 15-bit binary number indicates the hori- 
zontal offset of the top-left corner of the 
upsampled lower layer picture relative to the 
enhancement layer picture. This parameter 
must be an even number for the 4:2:0 and 4:2:2 
formats. 


Marker_bit 
Always a “1.” 


Lower_layer_vertical_offset 
This 15-bit binary number indicates the verti- 
cal offset of the top-left corner of the upsam- 


pled lower layer picture relative to the 
enhancement layer picture. This parameter 
must be an even number for the 4:2:0 format. 


Spatial_temporal_weight_code_table_index 
This 2-bit codeword indicates which spatial 
temporal weight codes are to be used. 


Lower_layer_progressive_frame 
This bit is “1” if the lower layer picture is pro- 
gressive. 


Lower_layer_deinterlaced_field_select 
This bit is used in conjunction with other 
parameters to assist the decoder. See Table 
13.23. 
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Figure 13.12. MPEG 2 Picture Temporal Scalable Extension Structure. Marker bits not shown. 
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Figure 13.13. MPEG 2 Picture Spatial Scalable Extension Structure. Marker bits not shown. 
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Lower Layer Lower Layer ei Apply 
Deinterlaced Progressive Picereselve Deinterlace sae For 
i Frame Prediction 
Field Select Frame Process 
0 0 1 yes top field 
1 0 1 yes bottom field 
1 1 1 no frame 
1 1 0 no frame 
1 0 0 yes both fields 


Table 13.23. MPEG 2 Picture Spatial Scalable Extension Upsampling Process. 


Slice Layer 


Data for each slice layer consists of a slice 
header followed by macroblock data. The 
structure is shown in Figure 13.5. 


Slice_start_code 

The first 24 bits have a value of 000001y. The 
last eight bits are slice_vertical_position, and 
have a value of 01q—-AFy. 

The slice_vertical_position specifies the 
vertical position in macroblock units of the 
first macroblock in the slice. The 
slice_vertical_position of the first row of mac- 
roblocks is one. 


Slice_vertical_position_extension 

This optional 3-bit binary number represents 
the three MSBs of an 11-bit 
slice_vertical_position value if the vertical size 
of the frame is >2800 lines. If the vertical size 
of the frame is <2800 lines, this field is not 
present. 


Priority_breakpoint 

This optional 7-bit binary number is present 
only when _ sequence_scalable_extension is 
present in the bitstream and scalable_mode = 
data partitioning. It specifies where in the bit- 
stream to partition. 


Quantizer_scale_code 

This 5-bit binary value has a value of 1 to 31 (a 
value of zero is forbidden). It specifies the 
scale factor of the reconstruction level of the 
received DCT coefficients. The decoder uses 
this value until another quantizer_scale_code is 
received at either the slice or macroblock 
layer. 


Intra_slice_flag 
If this bit is set to a “1,” intra_slice and 
reserved_bits data follows. 


Intra_slice 

This bit is present only if intra_slice_flag = “1.” 
It must be set to a “0” if any macroblocks in the 
slice are non-intra macroblocks. 
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Reser ved_bits 

These seven bits are present only if 
intra_slice_flag = “1.” These bits are always 
“0000000.” 


Extra_bit_slice 
A bit which, when set to “1,” indicates that 
extra_information_slice follows. 


Extra_information_slice 

If extra_bit_slice = “1” and intra_slice_flag = “1,” 
then these 9 bits follow consisting of 8 bits of 
data  (extra_information_slice) and then 
another extra_bit_slice to indicate if a further 9 
bits follow, and so on. 


Macroblock Layer 


Data for each macroblock layer consists of a 
macroblock header followed by motion vector 
and block data. The structure is shown in Fig- 
ure 13.5. 


Macroblock_escape 

This optional 11-bit field is a fixed bit string of 
“0000 0001 000” and is used when the differ- 
ence between the current macroblock address 
and the previous macroblock address is 
greater than 33. It forces the value of 
macroblock_address_increment to be increased 
by 33. Any number of consecutive 
macroblock_escape fields may be used. 


Macroblock_address_increment 

This is a variable-length codeword that speci- 
fies the difference between the current mac- 
roblock address and the previous macroblock 
address. It has a maximum value of 33. Values 
greater than 33 are encoded using the 
macroblock_escape field. The variable-length 
codes are listed in Table 13.24. 


Macroblock_type 

This variable-length codeword indicates the 
method of coding and macroblock content 
according to Tables 13.25, 13.26, and 13.27. 


Spatial_temporal_weight_code 

This optional 2-bit codeword indicates, in the 
case of spatial scalability, how the spatial and 
temporal predictions are combined to do the 
prediction for the macroblock. This field is 
present only if the [spatial temporal weight 
class] = 1 in Tables 13.25, 13.26, and 13.27, and 
spatial_temporal_weight_code_table_index # 
“00.” 


Frame_motion_type 

This optional 2-bit codeword indicates the mac- 
roblock motion prediction, as shown in Table 
13.28. It is present only if picture_structure = 
frame, frame_pred_frame_dct = “0” and [motion 
for ward] or [motion backward] = “1” in Tables 
13.25, 13.26, and 13.27. 


Field_motion_type 

This optional 2-bit codeword indicates the mac- 
roblock motion prediction, as shown in Table 
13.29. It is present only if [motion forward] or 
[motion backward] = “1” in Tables 13.25, 13.26, 
and 13.27 and frame_motion_type is not 
present. 


Det_type 

This optional bit indicates whether the mac- 
roblock is frame or field DCT coded. “1” = 
field, “0” = frame. It is present only if 
picture_structure = “11,” frame_pred_frame_dct 
= “0,” and [macroblock intra] or [coded pat- 
tern] = “1” in Tables 13.25, 13.26, and 13.27. 
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meemert | coi | Meemert | cat 

1 1 17 0000 0101 10 
2 011 18 0000 0101 01 
3 010 19 0000 0101 00 
4 0011 20 0000 0100 11 
5 0010 21 0000 0100 10 
6 0001 1 22 0000 0100 011 
7 0001 0 23 0000 0100 010 
8 0000 111 24 0000 0100 001 
9 0000 110 25 0000 0100 000 
10 0000 1011 26 0000 0011 111 
11 0000 1010 27 0000 0011 110 
12 0000 1001 28 0000 0011 101 
13 0000 1000 29 0000 0011 100 
14 0000 0111 30 0000 0011 011 
15 0000 0110 31 0000 0011 010 
16 0000 0101 11 32 0000 0011 001 
33 0000 0011 000 
macroblock_escape 0000 0001 000 


Table 13.24. MPEG 2 Variable-Length Code Table for macroblock_address_increment. 
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§ z g < 3 ro g = 0 
Ss S = 5 £ oi S_ io 
o = & s 2 fo S'5 8 
s $ 8 S g eo eee 
Type 3 = 0 a. 8 Fo | 32# Code 
2 3 s 3 = St = ow 
S 2 & 3S © =) wD ero 
8 $ 8 3 2 as | 5 = 
5 = = = a2 |e 
intra 0 0 1 0 1 
intra, quant 0 01 
I Pictures with Spatial Scalability 
coded, 
compatible 0 0 0 q 0 : t 
coded, 
compatible, 1 0 0 1 0 0 4 01 
quant 
intra 0 0 0 0 1 0 0 0011 
intra, quant 1 0 0 0 1 0 0 0010 
not coded, 
compatible 0 0 0 0 0 0 4 0001 
I Pictures with SNR Scalability 
coded 0 0 0 1 0 0 0 1 
coded, quant 1 0 0 1 0 0 0 01 
not coded 0 0 0 0 0 0 0 001 


Table 13.25. MPEG 2 Variable-Length Code Table for | Picture macroblock_type. 


596 = Chapter 13: MPEG 2 


F J = = 3 ae |S 

Ss CS 3 5 2 ok C_ 

So Fe & P= 2 2 6 © 

s $ 8 S g se | 232 

Type r) = a a. Hy Fo | 32# Code 

2 6 s 3 = Sz = ow 

2 ed 2 3 © E=i7y ero 

8 : 8 3 2 as | 5 = 

s = = = az |e 
mc, coded 0 1 0 1 0 0 0 1 
no mc, coded 0 0 0 1 0 0 0 01 
mc, not coded 0 1 0 0 0 0 0 001 
intra 0 0 0 0 1 0 0 0001 1 
me, coded, 1 1 0 1 0 0 0 0001 0 
quant 
no me, coded, 1 0 0 1 0 0 0 0000 1 
quant 
intra, quant 1 0 0 0 1 0 0 0000 01 

P Pictures with SNR Scalability 

coded 0 0 0 1 0 0 0 1 
coded, quant 1 0 0 1 0 0 0 01 
not coded 0 (0) 0 0 0 0 0 001 


Table 13.26a. MPEG 2 Variable-Length Code Table for P Picture macroblock_type. 
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Pr) 

5 =) z S os? ‘s 

§ 5 g 5 8 sa | %_% 

Co = <4 s 2 2 6 © 

s $ 8 S g ey | Seo 

Type 3 = oO a. 8 Fo | 32# Code 

2 6 s 3 = st ow 

2 =} 2 3 © wy ero 

8 S 8 3 2 ao | 5 = 

s = = £ n= & 

P Pictures with Spatial Scalability 

mc, coded 0 1 0 1 0 0 0 10 
mc, coded, 
compatible 0 1 0 pf 0 1 1,2,3 011 
no mc, coded 0 0 0 1 0 0 0 0000 100 
nD They Coded; 0 0 0 1 0 1 1,2,3 0001 11 
compatible 
mc, not coded 0 1 0 0 0 0 0 0010 
intra 0 0 0 0 1 0 0 0000 111 
micono cones 0 1 0 0 0 1 Log 0011 
compatible 
mae coded) 1 1 0 1 0 0 0 010 
quant 
HONE) CONE 1 0 0 1 0 0 0 0001 00 
quant 
intra, quant 1 0 0 0 1 0 0 0000 110 
mc, coded, 
compatible, quant : 1 0 ‘ u 7 1,2,3 ve 
np mcs coded, 1 0 0 1 0 1 128 0001 01 
compatible, quant 
aanG; not coded, 0 0 0 0 0 1 1,2,3 0001 10 
compatible 
coded, compatible 0 0 0 1 0 0 4 0000 101 
coded; compatible, 1 0 0 1 0 0 4 0000 010 
quant 
not coded, 0 0 0 0 0 0 4 0000 0011 
compatible 


Table 13.26b. MPEG 2 Variable-Length Code Table for P Picture macroblock_type. 


598 = Chapter 13: MPEG 2 


5 J i. e 3 eei/s, 

Ss cS 3 5 2 ou C_ 

oC s & » 2 Qo 2% & 

x 5 3 G a Es | %55 

Type 8 Fa a a. 8 Fo | 322 Code 

2 6 s 3 = SE =o 

2 2 2 3 © =) oD ero 

8 : 8 3 2 ao |5 = 

s = = £ n= & 
interp, not coded 0 1 1 0 0 0 0 10 
interp, coded 0 1 1 1 0 0 0 11 
bwd, not coded 0 0 1 0 0 0 0 010 
bwd, coded 0 0 1 1 0 0 0 011 
fwd, not coded 0 1 0 0 0 0 0 0010 
fwd, coded 0 1 0 1 0 0 0 0011 
intra 0 0 0 0 1 0 0 0001 1 
intra, coded, quant 1 1 1 1 0 0 0 0001 0 
fwd, coded, quant 1 1 0 1 0 0 0 0000 11 
bwd, coded, quant 1 0 1 1 0 0 0 0000 10 
intra, quant 1 0 0 0 1 0 0 0000 01 

B Pictures with Spatial Scalability 

interp, not coded 0 1 1 0 0 0 0 10 
interp, coded 0 1 1 1 0 0 0 11 
bwd, not coded 0 0 1 0 0 0 0 010 
bwd, coded 0 0 1 1 0 0 0 011 
fwd, not coded 0 1 0 0 0 0 0 0010 
fwd, coded 0 1 0 1 0 0 0 0011 
bid mae comes 0 0 1 0 0 1 123 0001 10 
compatible 
bwd, coded, compatible 0 0 1 1 0 1 1,2,3 0001 11 
awd, ot coded: 0 1 0 0 0 1 1,2,3 0001 00 
compatible 
fwd, coded, compatible 0 1 0 1 0 1 1,2,3 0001 01 
intra 0 0 0 0 1 0 0 0000 110 


Table 13.27a. MPEG 2 Variable-Length Code Table for B Picture macroblock_type. 
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F 2 z , $ | t2#/3 , 

s G 2 5 2 ol G_ 

Co = Zz » 2 Qo 2% & 

x 5 8 & S 5 3 950 

Type 8 = a a. 8 Fo | 32# Code 

3 5 < 3 = se Eom 

¢ 3 s 3 S 2m | EFS 

g Z 3S 8 2 ao | 5 = 

Ss = 5 = ns e 

B Pictures with Spatial Scalability (continued) 
interp, coded, 1 1 1 1 0 0 0 0000 111 
quant 
fwd, coded, 1 1 0 1 0 0 0 0000 100 
quant 
bwd, coded, 1 0 1 1 0 0 0 0000 101 
quant 
intra, quant 1 0 0 0 1 0 0 0000 0100 
fwd, coded, 
compatible, 1 1 0 1 0 1 1,2,3 0000 0101 
quant 
bwd, coded, 
compatible, 1 0) 1 1 0 1 1, 2,3 0000 0110 0 
quant 
not coded, 0 0 0 0 0 0 4 0000 01110 
compatible 
coded, quant, 1 0 0 1 0 0 4 0000 01101 
compatible 
coded, ; 0 0 0 1 0 0 4 0000 01111 
compatible 
B Pictures with SNR Scalability 

coded 0 0 1 0 1 
coded, quant 0 1 0 01 
not coded 0 0 0 0 001 


Table 13.27b. MPEG 2 Variable-Length Code Table for B Picture macroblock_type. 
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Spatial 


Temporal Prediction Motion Motion 
ee ht Tine Vector Vector Code 
B yp Count Format 
Class 
reserved 00 
0,1 field 2 field 01 
2,3 field 1 field 01 
0,1, 2,3 frame 1 frame 10 
0, 2,3 dual prime 1 field 11 
Table 13.28. MPEG 2 frame_motion_type Codewords. 
Spatlal an Motion Motion 
Temporal Prediction 
Weight Toe Vector Vector Code 
e yp Count Format 
Class 
reserved 00 
0,1 field 1 field 01 
0,1 16x 8 mc 2 field 10 
0 dual prime 1 field 11 


Table 13.29. MPEG 2 field_motion_type Codewords. 
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Quantizer_scale_code 

This optional 5-bit binary number has a value 
of 1-31 (a value of zero is forbidden). It speci- 
fies the scale factor of the reconstruction level 
of the received DCT coefficients. The decoder 
uses this value until another 
quantizer_scale_code is received. This field is 
present only when [macroblock quant] = “1” in 
Tables 13.25, 13.26, and 13.27. 


Optional Motion Vectors 


Marker _bit 

This always has a value of “1.” It is present only 
if concealment_motion_vectors = “1” and [mac- 
roblock intra] = “1” in Tables 13.25, 13.26, and 
13.27. 


Coded_block_pattern_420 

This optional variable-length codeword is used 
to derive the 4:2:0 coded block pattern (CBP) 
as shown in Table 13.30. It is present only if 
[coded pattern] = “1” in Tables 13.25, 13.26, 
and 13.27, and indicates which blocks in the 
macroblock have at least one transform coeffi- 
cient transmitted. The coded block pattern 
number is represented as: 


PP2P3P4P5P¢6 


where P,, = “1” for any coefficient present for 
block [n], else P, = “0.” Block numbering is 
given in Figure 13.2. 


Coded_block_pattern_1 

Present only if chroma_format = 4:2:2 and 
[coded pattern] = “1” in Tables 13.25, 13.26, 
and 13.27, this optional 2-bit field is used to 
extend the coded block pattern by two bits. 


Coded_block_pattern_2 

Present only if chroma_format = 4:4:4 and 
[coded pattern] = “1” in Tables 13.25, 13.26, 
and 13.27, this optional 6-bit field is used to 
extend the coded block pattern by six bits. 


Block Layer 


Data for each block layer consists of coefficient 
data. The structure is shown in Figure 13.5. 


Dct_dc_size_luminance 

This optional variable-length code is present 
only for Y intra-coded blocks, and specifies the 
number of bits in the _ following 
dct_dc_differential. The values are shown in 
Table 13.31. 


Dect_dc_differential 
If dct_dc_size_luminance # 0, this optional vari- 
able-length code is present. The values are 
shown in Table 13.33. 


Dct_dc_size_chrominance 

This optional variable-length code is present 
only for CbCr intra-coded blocks, and specifies 
the number of bits in the following 
dct_dc_differential. The values are shown in 
Table 13.32. 


Dect_dc_differential 
If dct_dc_size_chrominance # “0,” this optional 
variable-length code is present. The values are 
shown in Table 13.33. 


Det_coefficient_first 

This optional variable-length codeword is used 
for the first DCT coefficient in non-intra-coded 
blocks, and is defined in Tables 13.34 and 
13.35. 
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Dct_coefficient_next 

Up to 63 optional variable-length codewords 
present only for I, P and B frames. They are 
the DCT coefficients after the first one, and are 
defined in Tables 13.34 and 13.35. 


End_of_block 

This 2-bit or 4-bit value is used to indicate that 
no additional non-zero coefficients are present. 
The value of this parameter is “10” or “0110.” 


Motion Compensation 


Figure 13.14 illustrates the basic motion com- 
pensation process. Motion compensation 
forms predictions from previously decoded 
pictures, which are in turn combined with the 
coefficient data (error terms) from the IDCT: 


Field Prediction 


Prediction for P pictures is made from the two 
most recently decoded reference fields. The 
simplest case is shown in Figure 13.15, used 


when predicting the first picture of a frame or 
when using field prediction within a frame. 

Predicting the second field of a frame also 
requires the two most recently decoded refer- 
ence fields. This is shown in Figure 13.16 
where the second picture is the bottom field 
and in Figure 13.17 where the second picture 
is the top field. 

Field prediction for B pictures is made 
from the two fields of the two most recent ref- 
erence frames, as shown in Figure 13.18. 


Frame Prediction 


Prediction for P pictures is made from the 
most recently decoded picture, as shown in 
Figure 13.19. The reference picture may have 
been coded as either two fields or a single 
frame. 

Frame prediction for B pictures is made 
from the two most recent reference frames, as 
shown in Figure 13.20. Each reference frame 
may have been coded as either two fields or a 
single frame. 


Motion Compensation 
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CBP Code CBP Code 
60 111 62 0100 0 
4 1101 24 0011 11 
8 1100 36 0011 10 
16 1011 3 0011 01 
32 1010 63 0011 00 
12 1001 1 5 0010 111 
48 1001 0 9 0010 110 
20 1000 1 17 0010 101 
40 1000 0 33 0010 100 
28 0111 1 6 0010 011 
44 0111 0 10 0010 010 
52 0110 1 18 0010 001 
56 0110 0 34 0010 000 
1 0101 1 7 0001 1111 
61 0101 0 11 0001 1110 
2 0100 1 19 0001 1101 
35 0001 1100 38 0000 1100 
13 0001 1011 29 0000 1011 
49 0001 1010 45 0000 1010 
21 0001 1001 53 0000 1001 
41 0001 1000 57 0000 1000 
14 0001 0111 30 0000 0111 
50 0001 0110 46 0000 0110 
22 0001 0101 54 0000 0101 
42 0001 0100 58 0000 0100 
15 0001 0011 31 0000 0011 1 
51 0001 0010 47 0000 0011 0 
23 0001 0001 55 0000 0010 1 
43 0001 0000 59 0000 0010 0 
25 0000 1111 27 0000 0001 1 
37 0000 1110 39 0000 0001 0 
26 0000 1101 o* 0000 0000 1 


*Not to be used with 4:2:0 chroma structure. 


Table 13.30. MPEG 2 Variable-Length Code Table for coded_block_pattern_420. 
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DCT DC DCT DC 
Size Code Size Code 
Luminance Luminance 

0) 100 6 11110 
1 00 7 1111 10 
2 01 8 1111 110 
3 101 9 1111 1110 
4 110 10 111111110 
5 1110 11 111111111 


Table 13.31. MPEG 2 Variable-Length Code Table for dct_dc_size_luminance. 


DCT DC DCT DC 
Size Code Size Code 
Chrominance Chrominance 

0) 00 6 1111 10 

1 01 7 1111110 
2 10 8 1111 1110 
3 110 9 1111 11110 
4 1110 10 1111 111110 
5 11110 11 1111 111111 


Table 13.32. MPEG 2 Variable-Length Code Table for dct_dc_size_chrominance. 
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DCT DC DCT DC Code Code Additional 
Differential Size (Y) (CbCr) Code 
-2048 to -1024 11 111111111 1111111111 00000000000 to 01111111111 
-1023 to -512 10 111111110 1111111110 0000000000 to 0111111111 
-511 to -256 9 11111110 111111110 000000000 to 011111111 
-255 to -128 8 1111110 11111110 00000000 to 01111111 
-127 to -64 7 111110 1111110 0000000 to 0111111 
-63 to -32 6 11110 111110 000000 to 011111 
-31 to -16 5 1110 11110 00000 to 01111 
-15 to -8 4 110 1110 0000 to 0111 
-7 to -4 3 101 110 000 to 011 
-3 to -2 2 01 10 00 to 01 
-1 1 00 01 0 
0 0 100 00 
1 1 00 01 1 
2to3 2 01 10 10 to 11 
4to7 3 101 110 100 to 111 
8 to 15 4 110 1110 1000 to 1111 
16 to 31 5 1110 11110 10000 to 11111 
32 to 63 6 11110 111110 100000 to 111111 
64 to 127 7 111110 1111110 1000000 to 1111111 
128 to 255 8 1111110 11111110 10000000 to 11111111 
256 to 511 9 11111110 111111110 100000000 to 111111111 
512 to 1023 10 111111110 1111111110 1000000000 to 1111111111 
1024 to 2047 11 111111111 1111111111 10000000000 to 11111111111 
Table 13.33. MPEG 2 Variable-Length Code Table for dct_dc_differential. 
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Run Level Code 
EOB 10 
(0) 1 1s if first coefficient 
0 1 lls not first coefficient 
0 2 0100 s 
0 3 0010 Is 
10) 4 0000 110s 
0 5 0010 0110 s 
0 6 0010 0001 s 
0 7 0000 0010 10s 
0 8 0000 0001 1101 s 
0 9 0000 0001 1000 s 
0 10 0000 0001 0011 s 
0 11 0000 0001 0000 s 
10) 12 0000 0000 1101 0s 
0 13 0000 0000 1100 Is 
0 14 0000 0000 1100 0s 
0 15 0000 0000 1011 Is 
0 16 0000 0000 0111 lls 
0 17 0000 0000 0111 10s 
0 18 0000 0000 0111 Ols 
0 19 0000 0000 0111 00s 
0) 20 0000 0000 0110 lls 
0 21 0000 0000 0110 10s 
0 22 0000 0000 0110 Ols 
0 23 0000 0000 0110 00s 
0) 24 0000 0000 0101 lls 
0 25 0000 0000 0101 10s 
0 26 0000 0000 0101 Ols 
0 27 0000 0000 0101 00s 
0 28 0000 0000 0100 lls 
(0) 29 0000 0000 0100 10s 
0 30 0000 0000 0100 Ols 
Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 13.34a. MPEG 2 Variable-Length Code Table Zero for 
dct_coefficient_first and dct_coefficient_next. 
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Run Level Code 

0 31 0000 0000 0100 00s 

0 32 0000 0000 0011 000s 

0 33 0000 0000 0010 111s 

0 34 0000 0000 0010 110s 

0 35 0000 0000 0010 101s 

0 36 0000 0000 0010 100s 

0 37 0000 0000 0010 011s 

0 38 0000 0000 0010 010s 

0 39 0000 0000 0010 001s 

0 40 0000 0000 0010 000s 

1 1 011s 

1 2 0001 10s 

1 3 0010 0101 s 

1 4 0000 0011 00s 

1 5 0000 0001 1011 s 

1 6 0000 0000 1011 0s 

1 7 0000 0000 1010 Is 

1 8 0000 0000 0011 111s 

1 9 0000 0000 0011 110s 

1 10 0000 0000 0011 101s 

1 11 0000 0000 0011 100s 

1 12 0000 0000 0011 011s 

1 13 0000 0000 0011 010s 

1 14 0000 0000 0011 001s 

1 15 0000 0000 0001 0011s 

1 16 0000 0000 0001 0010s 

1 17 0000 0000 0001 0001s 

1 18 0000 0000 0001 000s 

2 1 0101 s 

2 2 0000 100s 

2 3 0000 0010 lls 

2 4 0000 0001 0100 s 
Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 13.34b. MPEG 2 Variable-Length Code Table Zero for 


dct_coefficient_first and dct_coefficient_next. 
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Run Level Code 
2 5 0000 0000 1010 Os 
3 1 0011 Is 
3 2 0010 0100 s 
3 3 0000 0001 1100 s 
3 4 0000 0000 1001 Is 
4 1 0011 Os 
4 2 0000 0011 11s 
4 3 0000 0001 0010 s 
5 1 0001 lls 
5 2 0000 0010 O1ls 
5 3 0000 0000 1001 Os 
6 1 0001 Ols 
6 2 0000 0001 1110 s 
6 3 0000 0000 0001 0100s 
7 1 0001 00s 
7 2 0000 0001 0101 s 
8 1 0000 111s 
8 2 0000 0001 0001 s 
9 1 0000 101s 
9 2 0000 0000 1000 1s 
10 1 0010 0111 s 
10 2 0000 0000 1000 Os 
11 1 0010 0011 s 
11 2 0000 0000 0001 1010s 
12 1 0010 0010 s 
12 2 0000 0000 0001 1001s 
13 1 0010 0000 s 
13 2 0000 0000 0001 1000s 
14 1 0000 0011 10s 
14 2 0000 0000 0001 0111s 
15 1 0000 0011 Ols 
15 2 0000 0000 0001 0110s 
Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 13.34c. MPEG 2 Variable-Length Code Table Zero for 
dct_coefficient_first and dct_coefficient_next. 
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Run Level Code 

16 1 0000 0010 00s 

16 2 0000 0000 0001 0101s 
17 1 0000 0001 1111 s 

18 1 0000 0001 1010 s 

19 1 0000 0001 1001 s 

20 1 0000 0001 0111 s 

21 1 0000 0001 0110 s 

22 1 0000 0000 1111 Is 
23 1 0000 0000 1111 Os 
24 1 0000 0000 1110 Is 
25 1 0000 0000 1110 Os 
26 1 0000 0000 1101 1s 
27 1 0000 0000 0001 1111s 
28 1 0000 0000 0001 1110s 
29 1 0000 0000 0001 1101s 
30 1 0000 0000 0001 1100s 
31 1 0000 0000 0001 1011s 

ESC 0000 01 
Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 13.34d. MPEG 2 Variable-Length Code Table Zero for 


dct_coefficient_first and dct_coefficient_next. 
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Run Level Code 

EOB 0110 

0 1 10s 

0 2 110s 

0 3 0111 s 

0 4 1110 Os 

0 5 1110 1s 

0 6 0001 O1ls 

0 vi 0001 00s 

0 8 1111 011s 

0 9 1111 100s 

0 10 0010 0011 s 

0 11 0010 0010 s 

0 12 1111 1010 s 

0 13 1111 1011 s 

0 14 1111 1110 s 
Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 13.35a. MPEG 2 Variable-Length Code Table One for 
dct_coefficient_first and dct_coefficient_next. 
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Run Level Code 
0 15 1111 1111 s 
0 16 0000 0000 0111 11s 
0 17 0000 0000 0111 10s 
0 18 0000 0000 0111 Ols 
0 19 0000 0000 0111 00s 
0 20 0000 0000 0110 lls 
0 21 0000 0000 0110 10s 
0 22 0000 0000 0110 01s 
0 23 0000 0000 0110 00s 
0 24 0000 0000 0101 lls 
0 25 0000 0000 0101 10s 
0 26 0000 0000 0101 Ols 
(0) 27 0000 0000 0101 00s 
0 28 0000 0000 0100 lls 
0 29 0000 0000 0100 10s 
0 30 0000 0000 0100 Ols 
(0) 31 0000 0000 0100 00s 
0 32 0000 0000 0011 000s 
0 33 0000 0000 0010 111s 
0 34 0000 0000 0010 110s 
0 35 0000 0000 0010 101s 
0 36 0000 0000 0010 100s 
0 37 0000 0000 0010 011s 
0 38 0000 0000 0010 010s 
0 39 0000 0000 0010 001s 
0 40 0000 0000 0010 000s 
1 1 010s 
1 2 0011 Os 
1 3 1111 001s 
1 4 0010 0111 s 
1 5 0010 0000 s 
1 6 0000 0000 1011 Os 

Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 13.35b. MPEG 2 Variable-Length Code Table One for 


dct_coefficient_first and dct_coefficient_next. 
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Run Level Code 

1 7 0000 0000 1010 1s 
1 8 0000 0000 0011 111s 
1 9 0000 0000 0011 110s 
1 10 0000 0000 0011 101s 
1 11 0000 0000 0011 100s 
1 12 0000 0000 0011 011s 
1 13 0000 0000 0011 010s 
1 14 0000 0000 0011 001s 
1 15 0000 0000 0001 0011s 
1 16 0000 0000 0001 0010s 
1 17 0000 0000 0001 0001s 
1 18 0000 0000 0001 0000s 
2 1 0010 Is 
2 2 0000 111s 
2 3 1111 1100 s 
2 4 0000 0011 00s 

2 5 0000 0000 1010 0s 

3 1 0011 1s 

3 2 0010 0110 s 

3 3 0000 0001 1100 s 

3 4 0000 0000 1001 Is 

4 1 0001 10s 

4 2 1111 1101 s 

4 3 0000 0001 0010 s 

5 1 0001 11s 

5 2 0000 0010 Os 

5 3 0000 0000 1001 0s 

6 1 0000 110s 

6 2 0000 0001 1110 s 

6 3 0000 0000 0001 0100s 

7 1 0000 100s 

7 2 0000 0001 0101 s 


Notes: 
1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 13.35c. MPEG 2 Variable-Length Code Table One for 
dct_coefficient_first and dct_coefficient_next. 
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Run Level Code 

8 1 0000 101s 

8 2 0000 0001 0001 s 

9 1 1111 000s 

9 2 0000 0000 1000 Is 

10 1 1111 010s 

10 2 0000 0000 1000 Os 

11 1 0010 0001 s 

11 2 0000 0000 0001 1010s 
12 1 0010 0101 s 

12 2 0000 0000 0001 1001s 
13 1 0010 0100 s 

13 2 0000 0000 0001 1000s 
14 1 0000 0010 Is 

14 2 0000 0000 0001 0111s 
15 1 0000 0011 Is 

15 2 0000 0000 0001 0110s 
16 1 0000 0011 Ols 

16 2 0000 0000 0001 0101s 
17 1 0000 0001 1111 s 

18 1 0000 0001 1010 s 

19 1 0000 0001 1001 s 

20 1 0000 0001 0111 s 

21 1 0000 0001 0110 s 

22 1 0000 0000 1111 Is 
23 1 0000 0000 1111 Os 
24 1 0000 0000 1110 Is 
25 1 0000 0000 1110 Os 
26 1 0000 0000 1101 1s 
27 1 0000 0000 0001 1111s 
28 1 0000 0000 0001 1110s 
29 1 0000 0000 0001 1101s 
30 1 0000 0000 0001 1100s 
31 1 0000 0000 0001 1011s 

ESC 0000 01 
Notes: 


1. s = sign of level; “0” for positive; s = “1” for negative. 


Table 13.35d. MPEG 2 Variable-Length Code Table One for 


dct_coefficient_first and dct_coefficient_next. 
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Run Length Code 
0 0000 00 
1 0000 01 
2 0000 10 
62 1111 10 
63 1111 11 


Table 13.36. Run Encoding Following An Escape Code 
for dct_coefficient_first and dct_coefficient_next. 


Level Code 
-2047 1000 0000 0001 
-2046 1000 0000 0010 
-1 1111 1111 1111 
forbidden 0000 0000 0000 
1 0000 0000 0001 
2047 0111 1111 1111 


Table 13.37. Level Encoding Following An Escape Code 
for dct_coefficient_first and dct_coefficient_next. 
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Figure 13.14. Simplified Motion Compensation Process. 
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Figure 13.15. P Picture Prediction of First Field or Field Prediction in a Frame Picture. 
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Figure 13.16. P Picture Prediction of Second Field Picture (Bottom Field). 
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Figure 13.17. P Picture Prediction of Second Field Picture (Top Field). 
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Figure 11.18. Field Prediction of B Field or Frame Pictures. 
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Figure 13.19. Frame Prediction For P Pictures. 


REFERENCE 
FRAME 


POSSIBLE 


INTERVENING 
B PICTURES 


ALREADY 
DECODED 


POSSIBLE 


INTERVENING 
B PICTURES 


NOT YET 
DECODED 


REFERENCE 
FRAME 


Figure 13.20. Frame Prediction For B Pictures. 
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Program Stream 


The program stream consists of one or more 
audio and/or video streams multiplexed 
together. Data from each audio and video 
stream is multiplexed and coded with data that 
allows them to be decoded in synchronization. 

Data from audio and video streams is 
stored in PES (Packetized Elementary 
Stream) packets. PES packets are then orga- 
nized in packs. The general format of the pro- 
gram stream is shown in Figure 13.21. 


Pack Layer 


Data for each pack consists of a pack header 
followed by an optional system header and one 
or more PES packets. 


Pack_start_code 
This 32-bit string has a value of 000001BAy 
and indicates the beginning of a pack. 


Marker _bits 
These 2 bits have a value of “01.” 


System_clock_reference_base [32-30] 


Marker _bit 
This bit has a value of “1.” 


System_clock_reference_base [29-15] 


Marker_bit 
This bit has a value of “1.” 


System_clock_reference_base [14-0] 


Marker _bit 
This bit has a value of “1.” 


System_clock_reference_extension 

This 9-bit field, along with the 
system_clock_reference_base field, comprises 
the system clock reference (SCR). The 
system_clock_reference_base field is specified in 
units of 1/300 multiplied by 90 kHz. The 
system_clock_reference_extension is specified in 
units of 27 MHz. SCR indicates the intended 
time of arrival of the byte containing the last 
bit of the system_clock_reference_base field at 
the input of the decoder. 


Marker _bit 
This bit has a value of “1.” 


Program_mux_rate 

This 22-bit binary number specifies a value 
measured in 50 bytes per second, with a value 
of zero forbidden. It indicates the rate at which 
the decoder receives the program stream. 


Marker _bit 
This bit has a value of “1.” 


Marker _bit 
This bit has a value of “1.” 


Reserved 
These five bits have a value of “00000.” 


Pack_stuffing_length 
This 3-bit binary number specifies the number 
of stuffing_byte fields following this field. 


Stuffing_byte 
0-7 stuffing bytes may be present. They are 
ignored by the decoder. Each byte has a value 
of “1111 1111.” 
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Figure 13.21. MPEG 2 Program Stream Structure. Marker and reserved bits not shown. 


System Header 


This field contains a summary of the bitstream 
parameters. There must be one following the 
first pack header, and then it may be optionally 
repeated in future pack headers. 


System_header_start_code 

This 32-bit string has a value of 000001BBy 
and indicates the beginning of a system 
header. 


Header_length 

This 16-bit binary number specifies the num- 
ber of bytes of the system header following 
this field. 


Marker _bit 
This bit has a value of “1.” 


Rate_bound 

This 22-bit binary number specifies a value 
greater than or equal to the maximum value of 
program_mux_rate. 


Marker_bit 
This bit has a value of “1.” 


Audio_bound 

This 6-bit binary number specifies a value (0 to 
32) that is greater than or equal to the maxi- 
mum number of active audio streams. 


Fixed_flag 
A “1” for this bit indicates fixed bit rate opera- 
tion. A “0” indicates variable bit rate operation. 


CSPS_flag 
This bit is a “1” ifthe program stream is a “con- 
strained system parameters stream” (CSPS). 


System _audio_lock_flag 

This bit is a “1” if there is a specified, constant 
relationship between the audio sampling rate 
and the decoder system clock frequency. 


System_video_lock_flag 

This bit is a “1” if there is a specified, constant 
relationship between the video picture rate and 
the decoder system clock frequency. 
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Program Stream 


Marker _bit 
This bit has a value of “1.” 


Video_bound 

This 5-bit binary number specifies a value (0 to 
16) that is greater than or equal to the maxi- 
mum number of active video streams. 


Reser ved_bits 
These seven bits have a value of “111 1111.” 


Stream_ID 

This optional 8-bit code shown in Table 13.40 
indicates the stream to which the P- 
STD_buffer_bound_scale and P- 
STD_buffer_size_bound fields apply. 


Marker_bits 
These optional two bits have a value of “11.” 
They are present only if stream_ID is present. 


P-STD_buffer_bound_scale 

This optional bit is present only when 
stream_ID is present and indicates the scaling 
factor used for P_STD_buffer_size_bound. A “0” 
indicates the stream_ID specifies an audio 
stream. A “1” indicates the stream_ID specifies 
a video stream. For other types of stream IDs, 
the value may be either a “0” or a “1.” 


P-STD_buffer_size_bound 

This optional 13-bit binary number specifies a 
value greater than or equal to the maximum 
decoder input buffer size. It is present only 
when stream_ID is present. If 
P_STD_buffer_bound_scale is a “0,” the unit is 
128 bytes. If P_STD_buffer_bound_scale is a 
“1,” the unit is 1024 bytes. 


Program Stream Map 


The program stream map provides a descrip- 
tion of the bitstreams in the Program Stream, 
and their relationship to one another. It is 
present as PES packet data if stream_ID = pro- 
gram stream map. 


Packet_start_code_prefix 
This 24-bit string has a value of 000001y and 
indicates the beginning of a stream map. 


Map_stream_ID 
This 8-bit string has a value of “1011 1100.” 


Program_stream_map_length 

This 16-bit binary number indicates the num- 
ber of bytes following this field. It has a maxi- 
mum value of 1018. 


Current_next_indicator 

A “1” for this bit indicates that the program 
stream map is currently applicable. A “0” indi- 
cates the program stream map is not applicable 
yet and will be the next one valid. 


Reserved 
These two bits have a value of “00.” 


Program_stream_map_version 

This 5-bit binary number specifies the version 
number of the program stream map. It must be 
incremented by one when the program stream 
map changes, wrapping around to zero after 
reaching a value of 31. 


Reserved 
These 7 bits have a value of “000 0000.” 


Marker_bit 
This bit always has a value of “1.” 
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Program_stream_info_length 

This 16-bit binary number specifies the total 
length of the descriptors immediately follow- 
ing this field. 


Descriptors 
This field is present for each descriptor sent. 


Elementary_stream_map_length 

This 16-bit binary number indicates the num- 
ber of bytes of all elementary stream informa- 
tion in this program stream map. 


Note: The following four fields are present for 
each descriptor sent. 


Stream_type 
This 8&bit codeword specifies the type of 
stream as shown in Table 13.38. 


Elementary_stream_ID 

This 8-bit field specifies the value of stream_ID 
in the PES packet header of PES packets con- 
taining this bitstream. 


Elementary_stream_info_length 

This 16-bit binary number specifies the total 
length of the descriptors immediately follow- 
ing this field. 


Descriptors 
This field is present for each descriptor sent. 


CRC_32 
This 32-bit CRC for the entire program stream 
map. 


Stream Type Code Stream Type Code 
reserved 0000 0000 H.222.1 0000 1001 
MPEG 1 video 0000 0001 ISO/IEC 13818-6 type A 0000 1010 
MPEG 2 video 0000 0010 ISO/IEC 13818-6 type B 0000 1011 
MPEG 1 audio 0000 0011 ISO/IEC 13818-6 type C 0000 1100 
MPEG 2 audio 0000 0100 ISO/IEC 13818-6 type D 0000 1101 
private sections 0000 0101 ISO/IEC 13818 auxiliary 0000 1110 
private data in PES packets 0000 0110 reserved 0000 1111 - 0111 1111 
MHEG 0000 0111 user private 1000 1111 - 1111 1111 
DSM CC 0000 1000 


Table 13.38. stream_type Codewords. 
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Transport Stream 


The transport stream consists of one or more 
188-byte packets. The data for each packet is 
from PES packets, PSI (Program Specific 
Information) sections, stuffing bytes, or private 
data. 

The general format of the transport stream 
is shown in Figure 13.22. 


Packet Layer 


Data for each packet consists of a packet 
header followed by an optional adaptation field 
and/or one or more data packets. 


Sync_byte 
This 8-bit string has a value of “0100 0111.” 


Transport_error_indicator 

A “1” for this bit indicates that at least one 
uncorrectable bit error is present in the 
packet. 


Payload_unit_start_indicator 
The meaning of this bit is dependent on the 
payload. 

For PES packet data, a “1” indicates the 
data block in this packet starts with the first 
byte of a PES packet. A “0” indicates no PES 
packet starts in the data block of this packet. 

For PSI data, a “1” indicates the data block 
of this packet contains the first byte of a PSI 
section. 


Transport_priority 

A “1” for this bit indicates that this packet is of 
higher priority than other packets having the 
same PID. 


PID 
This 13-bit codeword indicates the type of data 
in the data block, as shown in Table 13.39. 


Description Code 
program association table | 0 0000 0000 0000 
conditional access table 0 0000 0000 0001 

0 0000 0000 0010 
reserved through 

0 0000 0000 1111 
may be assigned as 
network_PID, 0 0000 0001 0000 
program_map_PID, through 
elementary_PID, 11111 1111 1110 
or for other purposes 
program stream directory | 1111111111111 


Table 13.39. MPEG 2 PID Codewords. 


Transport_scrambling_control 

This 2-bit code indicates the scrambling mode 
of the payload. “00” = not scrambled, “01” = 
reserved, “10” = scrambled with even key, “11” 
= scrambled with odd key. 


Adaptation_field_control 

This 2-bit code indicates whether this packet 
header is followed by an adaptation field or 
data block, as shown in Table 13.40. 


Description Code 
reserved 00 
data only 01 
adaptation field only 10 
adaptation field followed by rl 
data 


Table 13.40. MPEG 2 adaptation_field_control 
Codewords. 
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Figure 13.22. MPEG 2 Transport Stream Structure. Marker and reserved bits not shown. 


Continuity_counter 

This 4-bit binary number increments with each 
packet with the same PID. After reaching the 
maximum value, it wraps around. It does not 
increment when no data block is present in the 
packet. 


Data_byte 

These [n] data bytes are contiguous bytes of 
data from PES packets, PSI sections, stuffing 
bytes, or private data. 


PES Packet 


Data for each PES packet consists of a 32-bit 
start code that also identifies the stream to 
which the data belongs. Time stamps may be 
present along with other optional fields. The 
packet data contains a variable number of 
bytes from one elementary bitstream. 

The general format of the PES packet is 
shown in Figure 13.23. 


Packet_start_code_prefix 

This 24-bit field has a value of 000001, and in 
conjunction with stream_ID, indicates the 
beginning of a packet. 


Stream_ID 

This 8-bit code specifies the type and number 
of elementary streams, as shown in Table 
13.41. 


PES_packet_length 

This 16-bit binary number specifies the num- 
ber of bytes in the PES packet following this 
field. A value of zero indicates it is neither 
specified nor bounded, and is used only in 
transport streams. For the ATSC standard, the 
value must be 0000q. 


PES Packet 
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Figure 13.23. MPEG 2 PES Packet Structure. Marker and reserved bits not shown. 
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Stream Code 
all audio streams 1011 1000 
all video streams 1011 1001 
program stream map 1011 1100 
private stream 1 1011 1101 
padding stream 1011 1110 
private stream 2 1011 1111 
audio stream [xxxxx] 110x xxxx 
video stream [xxxx] 1110 xxxx 
ECM stream 1111 0000 
EMM stream 1111 0001 
DSM_CC stream 1111 0010 
ISO/IEC 13552 stream 1111 0011 
H.222.1 type A 1111 0100 
H.222.1 type B 1111 0101 
H.222.1 type C 1111 0110 
H.222.1 type D 1111 0111 
H.222.1 type E 1111 1000 
ancillary_stream 1111 1001 
reserved 1111 1010 
reserved 1111 1110 
program stream directory 1111 1111 


Table 13.41. MPEG 2 stream_ID 
Codewords. 


Note: The following fields (until the next note) 
are not present if stream_ID = program stream 
map, padding stream, private stream 2, ECM 
stream, EMM stream, DSM_CC stream, 
H.222.1 type E, or program stream directory. 


Marker_bits 
These optional two bits have a value of “10.” 


PES _scrambling_control 

This optional 2-bit code specifies the scram- 
bling mode. “00” = not scrambled, “01” = 
reserved, “10” = scrambled with even key, “11” 
= scrambled with odd key. For the ATSC stan- 
dard, the value must be “00.” 


PES priority 

This optional bit specifies the priority of the 
payload of the PES packet. A “1” has a higher 
priority than a “0.” 


Data_alignment_indicator 

A “1” for this optional bit indicates that the PES 
packet header is immediately followed by the 
video start code or audio syncword specified 
by the Data Stream Alignment Descriptor (if 
present). For the ATSC standard, the value 
must be “1.” 


Copyright 
A “1” for this bit indicates that the material is 
copyrighted. 


Original_or_copy 
A “1” for this bit indicates that the material is 
original. A “0” indicates it is a copy. 


PTS_DTS_flags 

A value of “10” for these two bits indicates a 
PTS (presentation time stamp) field is present 
in the PES packet header. A value of “11” indi- 
cates both a PTS and DTS (decoding time 
stamp) fields are present. A value of “00” indi- 
cates neither PTS or DTS fields are present. 


ESCR_flag 

A “1” for this bit indicates the ESCR (elemen- 
tary stream clock reference) base and exten- 
sion fields are present in the PES packet 
header. For the ATSC standard, the value must 
be “0.” 
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ES_rate_flag 

A“1” for this bit indicates ES_rate (elementary 
stream rate) is present in the PES packet 
header. For the ATSC standard, the value must 
be “0.” 


DSM_trick_mode_flag 
A “1” for this bit indicates 
trick_mode_control field is present. 


that the 


Additional_copy_info_flag 
A “1” for this bit indicates 
additional_copy_info field is present. 


that the 


PES_CRC_flag 

A “1” for this bit indicates that the 
previous_PES_packet_CRC field is present. For 
the ATSC standard, the value must be “0.” 


PES_extension_flag 
A “1” for this bit indicates that an extension 
field is present in this PES packet header. 


PES_header_data_length 

This 8-bit binary number specifies the number 
of bytes for optional fields and stuffing in this 
PES packet header. 


Marker_bits 

These optional four bits have a value of “0010.” 
This field is present only if PTS_DTS_flags = 
“10.” 


PTS [32-30] 
This optional field is 
PTS_DTS_flags = “10.” 


present only if 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “10.” 


PTS [29-15] 
This optional field is 
PTS_DTS_flags = “10.” 


present only if 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “10.” 


PTS [14-0] 

The optional 33-bit presentation time stamp 
(PTS) indicates the intended time of display by 
the decoder. It is specified in periods of the 27- 
MHz clock divided by 300. This field is present 
only if PTS_DTS_flags = “10.” 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “10.” 


Marker_bits 

These optional four bits have a value of “0011.” 
This field is present only if PTS_DTS_flags = 
“41” 


PTS [32-30] 
This optional field is 
PTS_DTS_flags = “11.” 


present only if 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “11.” 


PTS [29-15] 
This optional field is 
PTS_DTS_flags = “11.” 


present only if 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “11.” 
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PTS [14-0] 
This optional field is 
PTS_DTS_flags = “11.” 


present only if 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “11.” 


Marker_bits 

These optional four bits have a value of “0001.” 
This field is present only if PTS_DTS_flags = 
“11.” 


DTS [32-30] 
This optional field is 
PTS_DTS_flags = “11.” 


present only if 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “11.” 


DTS [29-15] 
This optional field is 
PTS_DTS_flags = “11.” 


present only if 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “11.” 


DTS [14-0] 

The optional 33-bit decoding time stamp (DTS) 
indicates the intended time of decoding. It is 
specified in periods of the 27-MHz clock 
divided by 300. This field is present only if 
PTS_DTS_flags = “11.” 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if PTS_DTS_flags = “11.” 


Reserved 
These optional two bits have a value of “00.” 
This field is present only if ESCR_flag = “1.” 


ESCR_base [32-30] 
This optional field is present only if ESCR_flag 
— ial ice 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if ESCR_flag = “1.” 


ESCR_base [29-15] 
This optional field is present only if ESCR_flag 
= se? 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if ESCR_flag = “1.” 


ESCR_base [14-0] 
This optional field is present only if ESCR_flag 
= bol es 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if ESCR_flag = “1.” 


ESCR_ extension 

The optional 9-bit elementary stream clock ref- 
erence (ESCR) extension and the 33-bit ESCR 
base are combined into a 42-bit value. It indi- 
cates the intended time of arrival of the byte 
containing the last bit of ESCR_base. The value 
of ESCR_base specifies the number of 90-kHz 
clock periods. 

The value of ESCR_extension specifies the 
number of 27-MHz clock periods after the 90 
kHz period starts. 

This field is present only if ESCR_flag = 
a Was 
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Marker_bit 
This optional bit has a value of “1.” This field is 
present only if ESCR_flag = “1.” 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if ES_rate_flag = “1.” 


ES _rate 

This optional 22-bit elementary stream rate 
(ES_rate) indicates the rate the decoder 
receives bytes of the PES packet. It is specified 
in units of 50 bytes per second. This field is 
present only if ES_rate_flag = “1.” 


Marker_bit 
This optional bit has a value of “1.” This field is 
present only if ES_rate_flag = “1.” 


Trick_mode_control 

This optional 3-bit codeword indicates which 
trick mode is applied to the video stream, as 
shown in Table 13.42. This field is present only 
if DSM_trick_mode_flag = “1.” 


Trick Mode Code 
fast forward 000 
slow motion 001 
freeze frame 010 
fast reverse 011 
slow reverse 100 
reserved 101 
reserved 110 
reserved 111 


Table 13.42. MPEG 2 trick_mode_control 
Codewords. 


Field_ID 

This optional 2-bit codeword indicates which 
fields are to be displayed, as shown in Table 
13.43. This field is present only if 
DSM_trick_mode_flag = 1” and 
trick_mode_control = “000” or “011.” 


Field ID Code 
top field only 00 
bottom field only 01 
complete frame 10 
reserved 11 


Table 13.43. MPEG 2 field_ID Codewords. 


Intra_slice_refresh 

A “1” for this optional bit indicates that there 
may be missing macroblocks between slices. 
This field is present only if 
DSM_trick_mode_flag = “1” and 
trick_mode_control = “000” or “011.” 


Frequency_truncation 

This optional 2-bit codeword indicates that a 
restricted set of coefficients may have been 
used in coding the data, as shown in Table 


13.44. This field is present only if 
DSM_trick_mode_flag = “1”, and 
trick_mode_control = “000” or “011.” 
Description Code 

only DC coefficients are non-zero 00 

first 3 coefficients are non-zero 01 

first 6 coefficients are non-zero 10 

all coefficients may be non-zero 11 


Table 13.44. MPEG 2 frequency_truncation 
Codewords. 
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Rep_cntrl 

This optional 5-bit binary number indicates the 
number of times each interlaced field or pro- 
gressive frame should be displayed. A value of 
“00000” is not allowed. This field is present 
only if DSM_trick_mode_flag = “1” and 
trick_mode_control = “O01” or “100.” 


Field_ID 

This optional 2-bit codeword shown in Table 
13.43 indicates which fields are to be dis- 
played. This field is present only if 
DSM_trick_mode_flag = “1” and 
trick_mode_control = “O10.” 


Reserved 

These three optional bits have a value of “000.” 
This field is present only if 
DSM_trick_mode_flag = “1” and 


trick_mode_control = “O10.” 


Reserved 

These five bits have a value of “00000.” This 
field is present only if DSM_trick_mode_flag = 
“1” and trick_mode_control = “101,” “110,” or 
“111.” 


Marker_bit 
This optional bit is always a “1.” This field is 
present only if additional_copy_info_flag = “1.” 


Additional_copy_info 

This optional 7-bit field contains private data 
regarding copyright information. This field is 
present only if additional_copy_info_flag = “1.” 


Previous_PES_packet_CRC 
These optional 16 bits are present only if 
PES_CRC_flag = “1.” 


PES _private_data_flag 

A “1” for this optional bit indicates that private 
data is present. This field is present only if 
PES_extension_flag = “1.” For the ATSC stan- 
dard, the value must be “0.” 


Pack_header_field_flag 

A “1” for this bit indicates that an MPEG 1 pack 
header or program stream pack header is in 
this PES packet header. This field is present 
only if PES_extension_flag = “1.” For the ATSC 
standard, the value must be “0.” 


Program_packet_sequence_counter_flag 
A “1” for this optional bit indicates that the 
program_packet_sequence_counter, 
MPEG1_MPEG2_identifier, and 
original_stuff_length fields are present. This 
field is present only if PES_extension_flag = “1.” 
For the ATSC standard, the value must be “0.” 


P-STD_buffer_flag 

A “1” for this optional bit indicates that P- 
STD_buffer_scale and P-STD_buffer_size are 
present. This field is present only if 
PES_extension_flag = “1.” For the ATSC stan- 
dard, the value must be “0.” 


Reserved 

These optional three bits are always “000.” 
This field is present only if PES_extension_flag 
= 1’ 


PES _extension_flag_2 
A “1” for this optional bit indicates that 
PES_extension_field_length and associated 
fields are present. This field is present only if 
PES_extension_flag = “1.” 
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PES_private_data 

These optional 128 bits of private data, com- 
bined with the fields before and after, must not 
emulate the packet_start_code_prefix. This field 
is present only if PES_extension_flag = “1” and 
PES_private_data_flag = “1.” 


Pack_field_length 

This optional 8-bit binary number indicates the 
length, in bytes, of an immediately following 
pack header. This field, and the immediately 
following pack header, are present only if 
PES_extension_flag = “1” and 
pack_header_field_flag = “1.” 


Marker_bit 

This optional bit is always a “1.” This field is 
present only if PES_extension_flag = “1” and 
program_packet_sequence_counter_flag = “1.” 


Program_packet_sequence_counter 

This optional 7-bit binary number increments 
with each successive PES packet in a program 
stream or MPEG 1 system stream. It wraps 
around to zero after reaching its maximum 
value. No two consecutive PES packets can 
have the same values. This field is present only 
if PES_extension_flag = is and 
program_packet_sequence_counter_flag = “1.” 


Marker_bit 

This optional bit is always a “1.” This field is 
present only if PES_extension_flag = “1” and 
program_packet_sequence_counter_flag = “1.” 


MPEG1_MPEG2_ identifier 

A “1” for this optional bit indicates the PES 
packet has information from a MPEG 1 system 
stream. A “OQ” indicates the PES packet has 
information from a program stream. This field 
is present only if PES_extension_flag = “1” and 
program_packet_sequence_counter_flag = “1.” 


Original_stuff_length 

This optional 6-bit binary number specifies the 
number of stuffing bytes used in the original 
PES or MPEG 1 packet header. This field is 
present only if PES_extension_flag = “1” and 
program_packet_sequence_counter_flag = “1.” 


Marker_bits 

These optional two bits are always “01.” This 
field is present only if PES_extension_flag = “1” 
and P-STD_buffer_flag = “1.” 


P-STD_buffer_scale 

This optional bit indicates the scaling factor for 
the following P-STD_buffer_size parameter. For 
audio streams, a value of “0” is present. For 
video streams, a value of “1” is present. For all 
other types of streams, a value of “0” or “1” 
may be used. This field is present only if 
PES_extension_flag = “\” and P- 
STD_buffer_flag = “1.” 


P-STD_buffer_size 

This optional 13-bit binary number specifies 
the size of the decoder input buffer. If P- 
STD_buffer_scale is a “0,” the unit is 128 bytes. 
If P-STD_buffer_scale is a “1,” the unit is 1024 
bytes. This field is present only if 
PES_extension_flag = a bag and P- 
STD_buffer_flag = “1.” 


Marker_bit 

This optional bit is always a “1.” This field is 
present only if PES_extension_flag = “1” and 
PES_extension_flag_2 = “1.” 


PES_extension_field_length 

An optional 7-bit binary number that indicates 
the number of bytes of the next field. This field 
is present only if PES_extension_flag = “1” and 
PES_extension_flag_2 = “1.” 
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PES_extension_field_data 

This optional field of n bytes is present only if 
PES_extension_flag = a ka and 
PES_extension_flag_2 = “1.” 


Stuffing_byte 

Each byte always has a value of “1111 1111.” 
Up to 32 stuffing bytes may be used. They are 
ignored by the decoder. 


PES_packet_data_byte 

[n] bytes of data from the audio stream, video 
stream, private stream 1, ancillary stream, 
H.222.1 types A-D stream, or ISO/IEC 13552 
stream. The number of bytes is derived from 
the PES_packet_length field. 


Note: The following field (until the next note) is 
present if stream_ID = program stream map, 
private stream 2, ECM stream, EMM stream, 
DSM_CC stream, H.222.1 type E, or program 
stream directory. 


PES_packet_data_byte 

[n] bytes of data from program stream map, 
private stream 2, ECM stream, EMM stream, 
DSM_CC stream, H.222.1 type E stream, or 
program stream directory descriptors. The 
number of bytes is derived from the 
PES_packet_length field. 


Note: The following field is present if stream_ID 
= padding stream. 


Padding_byte 

[n] bytes that have a value of “1111 1111.” The 
number of bytes is _ specified by 
PES_packet_length. It is ignored by the 
decoder. 


Descriptors 


Descriptors may be used to extend the defini- 
tions of various streams, and are contained 
within the program stream map. Their general 
format is: 


descriptor_tag (8 bits) 
descriptor_length (8 bits) 
data 


Descriptor tag values of 0, 1, and 16-63 are 
reserved. Descriptor tag values of 64-255 may 
be used for private user data. 


Data Stream Alignment Descriptor 


This descriptor provides a method of identify- 
ing the type of alignment in the stream. It must 
be present in an ATSC bitstream. 


Descriptor_tag 
This 8-bit binary number has a value of “0000 
0110.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Alignment_type 

This 8-bit codeword specifies the audio or 
video alignment type as shown in Table 13.45. 
For the ATSC standard, the value must be 
“0000 0010.” 


Copyright Descriptor 


This descriptor provides a method of identifica- 
tion of works of art. 


Descriptor_tag 
This 8-bit binary number has a value of “0000 
1101.” 
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Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Copyright_ID 
This 32-bit value is obtained from the Registra- 
tion Authority. 


Additional_copyright_info 
These optional [n] bytes of data are defined by 
the copyright owner and are never changed. 


Registration Descriptor 


This descriptor provides a method of identify- 
ing formats of private data. 


Descriptor_tag 
This 8-bit field has a value of “0000 0101.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Format_identifier 
This 32-bit value is obtained from the Registra- 
tion Authority. 


Additional_identification_info 
These optional [n] bytes of data are defined by 
the registration owner and are never changed. 


Target Background Grid Descriptor 


This descriptor provides displaying the video 
within a specified location of the display. It is 
useful when the video is not intended to use 
the full area of the display. 


Descriptor_tag 
This 8-bit field has a value of “0000 0111.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Video Stream Audio Stream Code 
Alignment Type Alignment Type 
reserved reserved 0000 0000 
slice, picture, GOP, or sequence sync word 0000 0001 
picture, GOP, or sequence 0000 0010 
GOP or sequence 0000 0011 
sequence 0000 0100 
reserved 
0000 0101 
reserved : 
1111 1111 


Table 13.45. alignment_type Codewords. 
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Horizontal_size 

This 14-bit binary number specifies the hori- 
zontal size of the target background grid in 
samples. 


Ver tical_size 
This 14-bit binary number specifies the verti- 
cal size of the target background grid in lines. 


Aspect_ratio_information 
This 4-bit codeword specifies the aspect ratio 
as defined in the video sequence header. 


Language Descriptor 


This descriptor provides a method to indicate 
the language(s) of the stream. 


Descriptor_tag 
This 8-bit field has a value of “0000 1010.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Note: The following two fields are present for 
each language. 


ISO_639_language_code 
This 24-bit field contains a 3-character lan- 
guage code. 


Audio_type 
This 8-bit codeword identifies the audio type: 


00y = reserved 

01y = clean effects (no language) 
02y = hearing impaired 

03y = visual impaired commentary 
04y-FFy = reserved 


System Clock Descriptor 


This descriptor provides a method to indicate 
information about the system clock used to 
generate timestamps. 


Descriptor_tag 
This 8-bit field has a value of “0000 1011.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


External_clock_reference_indicator 

A “1” for this bit indicates that the system clock 
was derived from a reference that may be avail- 
able at the decoder. 


Reserved 
This bit is always a “0.” 


Clock_accuracy_integer 

Combined with clock_accuracy_exponent, this 
6-bit binary number provides the fractional fre- 
quency accuracy of the system clock. 


Clock_accuracy_exponent 

Combined with clock_accuracy_integer, this 3- 
bit binary number provides the fractional fre- 
quency accuracy of the system clock. 


Reserved 
These 5 bits are always “00000.” 


Multiplex Buffer Utilization Descriptor 
This descriptor provides bounds on the occu- 
pancy of the STD multiplex buffer. 


Descriptor_tag 
This 8-bit field has a value of “0000 1100.” 
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Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Bound_valid_flag 

A li for this bit indicates 
LTW_offset_lower_bound and 
LTW_offset_upper_bound are valid. 


LTW_offset_lower_bound 

This 15-bit binary number is in units of (27 
MHz / 300) clock periods. It specifies the low- 
est value any LTW_offset field will have. 


Reserved 
This bit is always a “0.” 


LTW_offset_upper_bound 

This 15-bit binary number is in units of (27 
MHz / 300) clock periods. It specifies the 
upper value any LTW_offset field will have. 


Private Data Descriptor 


Descriptor_tag 
This 8-bit field has a value of “0000 1111.” 


Descriptor_length 

This 8-bit binary number specifies the number 
of bytes following this field. It will have a value 
of “0000 0100.” 


Private_data_indicator 
These 32 bits are private data and are not 
defined by the MPEG 2 specification. 


Video Stream Descriptor 


This descriptor provides basic information 
about the video stream. 


Descriptor_tag 
This 8-bit field has a value of “0000 0010.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Multiple_frame_rate_flag 
A “1” for this bit indicates multiple frame rates 
may be present in the video stream. 


Frame_rate_code 

This 4-bit codeword indicates the video frame 
rate, as shown in Table 13.46. When 
multiple_frame_rate_flag is a “1,” the indication 
of a specific frame rate also allows other frame 
rates to be present in the video stream. 


MPEG _1_only_flag 
A “1” for this bit indicates the video stream 
contains only MPEG 1 video data. 


Constrained_parameter_flag 

If MPEG_1_only_flag is a “O,” this bit must be a 
“1.” If MPEG_1_only_flag is a “1,” this bit 
reflects the value of the 
constrained_parameter_flag in the MPEG 1 
video stream. 


Still_picture_flag 

A “1” for this bit indicates the video stream 
contains only still pictures. A “0” indicates the 
video stream may have either still or moving 
pictures. 
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Profile_and_level_indication 

This 8bit codeword reflects the same or 
higher profile and level as indicated by the 
profile_and_level_indication field in the MPEG 
2 video stream. This field is present only if 
MPEG_1_only_flag = “0.” 


Chroma_format 

This 2-bit codeword reflects the same or 
higher chroma format as indicated by the 
chroma_format field in the MPEG 2 video 
stream. This field is present only if 
MPEG_1_only_flag = “0.” 


Frame_rate_extension_flag 

A “1” for this bit indicates that either or both of 
the frame_rate_extension_n and 
frame_rate_extension_d fields in any MPEG 2 
video stream are non-zero. This field is present 
only if MPEG_1_only_flag = “0.” 


Reserved 
These 5 bits are always “00000.” This field is 
present only if MPEG_1_only_flag = “0.” 


Audio Stream Descriptor 


This descriptor provides basic information 
about the audio stream. 


Descriptor_tag 
This 8-bit field has a value of “0000 0011.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Free_format_flag 
A “1” for this bit indicates the bitrate_index 
field in the audio stream is “0000.” 


Indicated Frame May Also Include Code 
Rate These Frame Rates 
forbidden 0000 
23.976 0001 
24.0 23.976 0010 
25.0 0011 
29.97 23.976 0100 
30.0 23.976, 24.0, 29.97 0101 
50.0 25.0 0110 
59.94 23.976, 29.97 0111 
60.0 23.976, 24.0, 29.97, 30.0, 59.94 1000 
reserved 1001 
reserved 1111 


Table 13.46. frame_rate_code Codewords. 
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ID 
This bit is set to the same value as the ID field 
in the audio stream. 


Layer 

This 2-bit binary number is set to the same or 
higher value as the highest layer in any audio 
stream. 


Variable_rate_audio_indicator 

A “0” for this bit indicates that the bit rate of 
the audio stream may vary between audio 
frames. 


Reserved 
These 3 bits are always “000.” 


Video Window Descriptor 


This descriptor provides a method to indicate 
information about the window characteristics 
of the video stream. 


Descriptor_tag 
This 8-bit field has a value of “0000 1000.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Horizontal_offset 

This 14-bit binary number indicates the hori- 
zontal position of the top left pixel of the video 
window on the target background grid. 


Vertical_offset 

This 14-bit binary number indicates the verti- 
cal position of the top left pixel of the video 
window on the target background grid. 


Window_priority 

This 4-bit binary number indicates how video 
windows overlap. A value of “0000” is lowest 
priority and “1111” is highest priority. Higher 
priority windows are visible over lower priority 
windows. 


Hierarchy Descriptor 


This descriptor provides a method to indicate 
information about hierarchically coded audio 
and video which are multiplexed into multiple 
streams. 


Descriptor_tag 
This 8-bit field has a value of “0000 0100.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Reserved 
These four bits are always “0000.” 


Hierarchy_type 

This 4-bit codeword indicates the relationship 
between the hierarchy layer and its embedded 
layer as shown in Table 13.47. 
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Hierarchy Type Code 
reserved 0000 
spatial scalability 0001 
SNR scalability 0010 
temporal scalability 0011 
data partitioning 0100 
extension bitstream 0101 
private bitstream 0110 
reserved 0111-1110 
base layer 1111 


Table 13.47. hierarchy_type Codewords. 


Reserved 
These two bits are always “00.” 


Hierarchy_layer_index 
This 6-bit binary number indicates a unique 
index of the stream. 


Reserved 
These two bits are always “00.” 


Hierarchy_embedded_layer_index 

This 6-bit binary number defines the hierarchi- 
cal table index of the stream that must be 
accessed prior to decoding. This parameter is 
undefined for a hierarchy_type value of “1111.” 


Reserved 
These two bits are always “00.” 


Hierarchy_channel 

This 6bit binary number indicates the 
intended channel number. The most robust 
channel has the lowest value. 


Maximum Bitrate Descriptor 


This descriptor provides a method to indicate 
information about the maximum bit rate 
present. 


Descriptor_tag 
This 8-bit field has a value of “0000 1110.” 


Descriptor_length 
This 8-bit binary number specifies the number 
of bytes following this field. 


Reserved 
These two bits are always “00.” 


Maximum_bitrate 

This 22-bit binary number indicates the maxi- 
mum bit rate present, in units of 50 bytes per 
second. 


Private Data 


Private data may be contained within the PES 
packet header, as indicated by the optional 16 
bytes of PES_private_data. 

Private data may also be contained within 
the PES_packet_data_byte fields. stream_type is 
used to indicate private data is present. 

Private descriptors, previously discussed, 
are also available. 


Video Decoding 639 


Video Decoding 


The video decoder essentially performs the 
inverse function as the encoder. From the 
coded bitstream, it reconstructs the I frames. 
Using I frames, additional coded data, and 
motion vectors, the P and B frames are gener- 
ated. Finally, the frames are output in the 
proper order. 

Figure 13.24 illustrates the block diagram 
of a basic MPEG 2 video decoder. Figure 13.25 
illustrates the block diagram of a MPEG 2 
video decoder that supports SNR scalability. 
Figure 13.26 illustrates the block diagram of a 
MPEG 2 video decoder that supports temporal 
scalability. 


Audio/Video Synchronization 


The PCR (program reference clock) is usually 
used for synchronization. Its arrival time is 
determined by the encoding process and the 
broadcast path. The video decoder synchro- 
nizes its system time clock (STC) and the 27- 
MHz display clock to the PCR. If synchroniza- 
tion starts to drift, the video decoder’s buffer 
either may overflow or run out of compressed 
data. The video decoder prevents this by sens- 
ing the loss of synchronization, and then either 
dropping or repeating pictures to resynchro- 
nize to the PCR values. 


Coarse Synchronization 


Coarse synchronization may be done without 
using a PLL. The STC can be made to stay in 
approximate synchronization with the PCR by 
continuously refreshing its contents based on 
the value of arriving PCR values. 

Since the 27-MHz display clock is not syn- 
chronized with the PCR, the video decoder’s 
PTS value (measured by the decoder’s STC 


from the timing of the VSYNC# pulses) will 
drift with respect to the received PTS. If the 
PTS is not within 22.5 ms of the decoder’s 
STC, the video decoder either skips the next B 
picture or repeats the current picture. Audio 
may be synchronized by monitoring the STC 
from the video decoder and adjusting the audio 
accordingly. 


Fine Synchronization 


Fine synchronization may be achieved by 
locking the video decoder’s 27-MHz display 
clock to the PCR using a PLL. The display 
clock is used to control the output of the video 
decoder. 

The video decoder has an internal counter 
that implements the system time clock (STC). 
This counter has a resolution of 90 kHz and is 
driven by the display clock. 

The STC is initialized at the beginning of 
the decoding process. Since the display clock 
is synchronized to the PCR, the STC will track 
PCR changes, except when the PCR changes 
rapidly, such as when changing a channel. 
Using the STC to measure the timing of the 
display VSYNC# pulses, and comparing that 
timing to a stored PTS value, enables the video 
decoder to choose the appropriate moment to 
decode and display a particular video picture. 
Audio synchronization can be handled by lock- 
ing the audio subsystem to the display clock. 
Audio PTS values then are used to play the 
audio. 


Lip Sync Issues 


Lip sync is an implementation issue that has 
nothing at all to do with the MPEG standard. 
Assuming that the audio and video are in good 
sync at the encoder input, the MPEG system of 
program clock reference (PCR) and presenta- 
tion time stamps (PTS) provides the tools to 
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Figure 13.24. Simplified MPEG 2 Decoder Block Diagram. 
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Figure 13.25. Simplified MPEG 2 SNR Scalability Decoder Block Diagram. 
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Figure 13.26. Simplified MPEG 2 Temporal Scalability Decoder Block Diagram. 


maintain the audio/video timing relationship. 
Decoders, and possibly encoders, with incor- 
rect implementations of these tools have been 
manufactured. 

A representative example of an incorrect 
implementation in decoders has to do with the 
reading of the PTS. Transmission of PTS is 
required at intervals of no greater than 700 
milliseconds. Some decoders read the PTS 
when the station is first tuned in, then incor- 
rectly “freewheel” afterwards on the basis of 
temporal reference values, never checking a 
PTS again until the channel is changed or 
turned on again. Some, incorrectly, completely 
ignore the temporal reference values and 
instead look at the time stamps. As time goes 
by, early implementation mistakes are being 
corrected, reducing the occurrence of lip sync 
problems. 


Testing Issues 


Since they are so complex, MPEG decoders 
have special testing challenges. Unlike linear 
systems, the output from a MPEG decoder is 
not only dependent on the input at any given 


point in time, but also the previous state of the 
system before the input was received. Since 
the output is the result of many combinations 
of inputs, the number of output states to be 
tested rapidly grows beyond reason. 

The question is how to measure the output 
of the MPEG decoder. Since the original image 
has been encoded and decoded, there will be 
differences between the original and decoded 
images. Some high-frequency information may 
be lost, and there may be compression arti- 
facts in the output. For this reason, direct 
numerical comparison between an original and 
decoded image is not practical. 

Alternately, the output of the decoder 
could be compared to the output of a reference 
decoder. However, the inverse discrete cosine 
transfer (IDCT) accuracy must be taken into 
account. Different IDCT implementations may 
have slightly different outputs from the same 
input. Also, additional differences may occur if 
the encoder and decoder use different IDCT 
implementations. 

The predictive nature of MPEG can be 
used to test the proper operation of the 
decoder. This technique does not require inter- 
nal access to the decoder or numerical compar- 
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isons of the output to a reference. Instead, the 
output image is designed in such a way that it 
will tend to highlight decoder errors. If the 
decoder is working properly, the result is an 
image that you can visually inspect. 

The best approach is to test the range of 
MPEG features one function at a time, process- 
ing a number of test bitstreams one after 
another. Limiting the scope of each test makes 
it easier to pinpoint problems. The tester can 
also quickly determine which (if any) MPEG 
syntax the decoder fails to correctly process. 


Encoder Bitstreams Not Adequate 


Decoder testing should include the use of “typ- 
ical” video sequences produced by one or 
more production encoders. These sequences 
give an indication of how the decoder will look 
in normal use. They are not, however, suffi- 
cient to ensure confidence that it will work 
with all bitstreams. 


Syntax Testing 


MPEG is a standard that defines the syntax 
that a decoder must recognize and process, 
but it does not define what syntax an encoder 
must use to generate a bitstream. 

Most encoders use only a part of the 
MPEG syntax. As a result, a decoder that 
works with one encoder may not work with 
another one that uses different MPEG fea- 
tures. Decoders must be adequately tested to 
ensure that they will work with future encod- 
ers. The easiest way to ensure this is to test the 
decoder using all the various syntax elements. 
All conditions that are permitted by a given 
syntax should be tested. 


The development of test bitstreams should 
be separated from the development of the 
decoder. If the team that develops the decoder 
also develops the test bitstreams, misunder- 
standings about the MPEG specification 
details could be built into the tests. 

The test bitstreams should also make it 
obvious if there are display problems. This 
includes distinctive marks at each corner of 
the image to ensure that the entire image is 
displayed. Smooth horizontal and_ vertical 
motion helps verify field order and alignment. 
Each frame should also be unique so that it is 
visually obvious if a frame has been skipped. 


More than Just Video 


So far, only video has been discussed. How- 
ever, video is multiplexed with audio, data, and 
timing information into a program or transport 
stream. Testing at these levels is also needed. 

Testing the audio/video synchronization 
requires a bitstream with known relationships 
between unique audio and video events. The 
test should be designed to facilitate both 
instrument measurement and visual inspec- 
tion. A shaped audio pulse correlated to a spe- 
cific video event works well. 

Testing at the program or transport layer 
must also include checking the ability to 
receive and utilize the system information that 
is multiplexed into the bitstream. This includes 
the program association table and program 
map table, necessary for the decoder to find 
the individual bitstreams that comprise a pro- 
gram. There may also be program guides, rat- 
ing tables, conditional access tables, and other 
information that should be tested. 
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Pushing the Limits 


So far, we have discussed testing one or a two 
things at a time. This may help isolate prob- 
lems and decrease debugging time, but will 
not answer how a decoder will work under 
real-world conditions. 

Since MPEG is statistical in nature, the 
information sent to the decoder is not at a con- 
stant rate. What happens when many DCT 
coefficients are received in a short period of 
time, many MPEG features are enabled simul- 
taneously, and a large percentage of the bits in 
a frame occur in a single slice? A thorough test- 
ing strategy must check a decoder’s ability to 
handle bitstreams that push multiple elements 
to the limit. 

It is also important to see how a decoder 
recovers from errors, since it is likely to occa- 
sionally receive noncompliant bitstreams. Will 
it freeze or recover gracefully? How long does 
it take to recover? 

If error concealment has been designed 
into the system, it should be tested by bit- 
streams with known errors. The output can 
then be checked to see how effectively the con- 
cealment works. 
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Digital Television 
(DTV) 


Although digital television (DTV) has only 
recently been introduced to the market, it has 
been in development for many years. Out of 
these major efforts, two major solutions have 
emerged: ATSC (Advanced Television Sys- 
tems Committee) and DVB (Digital Video 
Broadcast). 

A comparison between the various stan- 
dards is shown in Table 14.1. The basic audio 
and video capabilities are very similar. The 
major differences are the RF modulation 
schemes (and available bit rates) used for over- 
the-air transmission, and the level of definition 
for non-audio/video services. 


ATSC 


The ATSC standard is actually a group of stan- 
dards: 


A/52: Digital Audio Compression Standard 
(AC-3) 


A/53: ATSC Digital Television Standard 
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A/54: Guide to the Use of the ATSC Digital 
Television Standard 


A/57: Program/Episode/Version Identifi- 
cation 


A/58: Harmonization With DVB SI in the 
Use of the ATSC Digital Television 
Standard 


A/64: Transmission Measurement and 
Compliance for Digital Television 


A/65: Program and System Information 
Protocol for Terrestrial Broadcast 
and Cable 


A/70: Conditional Access System for Ter- 
restrial Broadcast 


A/80: Modulation and Coding Require- 
ments for Digital TV (DTV) Applica- 
tions Over Satellite 


A/90: ATSC Data Broadcast Standard 
The ATSC standard transmits compressed 


digital video, compressed digital audio, and 
ancillary data over a single 6 MHz channel. 
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The system delivers up to about 19.4 Mbps of 
data in a 6 MHz terrestrial (over-the-air) chan- 
nel and up to about 38.8 Mbps of data in a 6 
MHz cable channel. This enables one HDTV 
program to be transmitted in a conventional 6 
MHz terrestrial channel. 


Video Capability 


Although any resolution may be used as long 
as the bit rate is not exceeded, there are sev- 
eral resolutions that will probably be the most 
popular. These are indicated in Table 14.2. 
Both interlaced and progressive pictures are 
permitted for most of the resolutions. 

Video compression is based on MPEG 2. 
However, there are some minor constraints on 
some of the MPEG 2 parameters, as discussed 
within the chapter on MPEG 2. 


Audio Capability 


Audio compression is implemented using 
Dolby Digital (AC-3). 

The main audio, or associated audio which 
is a complete service (containing all necessary 
program elements), has a bit rate <384 kbps. A 
single channel associated service containing a 
single program element has a bit rate <128 
kbps. A two channel associated service con- 
taining only dialogue has a bit rate <192 kbps. 
The combined bit rate of a main and associated 
service which are intended to be decoded 
simultaneously must be <512 kbps. There is 
work underway to increase the 5.1 channel 
maximum bit rate from 384 to 440 kbps, which 
matches the capability of DVD and permits the 
transmission of a less compressed 5.1 audio 
signal. 

There are several types of audio service 
defined: 


Parameter ATSC DVB ISDB* OpenCable 
video compression MPEG 2 
audio compression Dolby Digital MPEG, Dolby Digital Dolby Digital 
system bitstream format MPEG 2 transport stream 
RF transmission: terrestrial 8-VSB COFDM BST-OFDM2 - 
are 3 16-QAM, 32-QAM, 64-QAM, 64-QAM, 
RF transmission: cable 16-VSB 128-QAM, 256-QAM 256-QAM 
Sie : QPSK, 8-PSK, 
RF transmission: satellite 16-QAM QPSK - 
RF channel bandwidth 6 MHz 6, 7, or 8 MHz 6 MHz 
Notes: 


1. ISDB = Integrated Services Digital Broadcasting, to be used in Japan. 
2. BST-OFDM = Bandwidth Segmented Transmission of OFDM. 
3. Most major cable companies have adopted QAM over 16-VSB. 


Table 14.1. Comparison of the Various DTV Standards. 
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Refresh Rate ee 
A i p = progressive, i = interlace 
cs _ eee 23.976p 29.97p ' 59.941 | 59,95p 
24p 25p 30p 50 B0p 60i 60p 
352 x 480 x x x x 
480 x 480 x x x x 
480 x 576 x x x x 
528 x 480 x x x x 
544 x 480 apy x x x x 
544 x 576 x x x x 
640 x 480 x x x x 
704 x 480 x x x x 
720 x 576 x x x x 
1280 x 720 x x x x x 
1280 x 1080 x x x x x 
1440 x 1080 me x x x x x 
1920 x 1080 x x x x x 


Table 14.2. Common Active Resolutions for Digital Television. 


Complete Main Audio Service (CM) 

This type of main audio service contains a 
complete audio program (dialogue, music, and 
effects). This is the type of audio service nor- 
mally provided, and may contain 1-5.1 audio 
channels. 

The CM service may be further enhanced 
by using the VI, HI, C, E, or VO associated ser- 
vices. Audio in multiple languages may be pro- 
vided by supplying multiple CM services, each 
in a different language. 


Main Audio Service, Music and Effects 
(ME) 

This type of main audio service contains the 
music and effects of an audio program, but not 


the dialogue. It may contain 1-5.1 audio chan- 
nels. The primary program dialogue (if any 
exists) is supplied by a D service. 


Visually Impaired (VI) 

This service typically contains a narrative 
description of the program content. In this 
case, the VI service uses a single audio chan- 
nel. The simultaneous decoding of both the VI 
and CM allows the visually impaired to enjoy 
the program. 

Besides providing VI as a single narrative 
channel, it may be provided as a complete pro- 
gram mix containing music, effects, dialogue, 
and the narration. In this case, the service may 
use up to 5.1 channels. 
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Hearing Impaired (HI) 

This service typically contains only dialogue 
which is intended to be reproduced simulta- 
neously with the CM service. In this case, HI is 
a single audio channel. 

Besides providing HI as a single dialogue 
channel, it may be provided as a complete pro- 
gram mix containing music, effects, and dia- 
logue. In this case, the service may use up to 
5.1 channels. 


Dialogue (D) 

This service contains program dialogue 
intended for use with the ME main audio ser- 
vice. 

A complete audio program is formed by 
simultaneously decoding both the D and ME 
services and mixing the D service into the cen- 
ter channel of the ME main service (with 
which it is associated). 

If the ME service contains more than two 
audio channels, the D service is monophonic. 
If the ME service contains two channels, the D 
service may also contain two channels. In this 
case, a complete audio program is formed by 
simultaneously decoding the D and ME ser- 
vices, mixing the left channels of the ME and 
D service, and mixing the right channels of the 
D and ME service. The result will be a two- 
channel stereo signal containing music, 
effects, and dialogue. 

Audio in multiple languages may be pro- 
vided by supplying multiple D services (each 
in a different language) along with a single ME 
service. This is more efficient than providing 
multiple CM services. However, in the case of 
more than two audio channels in the ME ser- 
vice, this requires that the dialogue be 
restricted to the center channel. Some receiv- 
ers may not have the capability to simulta- 
neously decode ME and D services. 


Commentary (C) 

This service is similar to the D service, except 
that instead of conveying essential program 
dialogue, it conveys an optional program com- 
mentary using a single audio channel. 

In addition, it may be provided as a com- 
plete program mix containing music, effects, 
dialogue, and the commentary. In this case, 
the service may use up to 5.1 channels. 


Emergency (FE) 

This service is intended to allow the insertion 
of emergency or high priority announcements, 
and is always a single audio channel. 

E service is given priority in transport and 
audio decoding. Whenever it is present, the 
audio decoder stops decoding any other ser- 
vice being received and only decodes the E 
service into the center channel (or left and 
right channels if a center loudspeaker does not 
exist). 


Voice-Over (VO) 

This service is a _ single-channel service 
intended to be decoded and mixed with the 
ME main audio service. It has second priority 
(only the E service has higher priority). 


Closed Captioning and Emergency 
Messages 


Closed captioning (EJA-708) and emergency 
messages may be added to the MPEG 2 video 
stream through the use of a user data exten- 
sion. This extension data may be inserted at 
the sequence, GOP and picture layer. 

The bitstream syntax is: 


User_data_start_code 
This 32-bit string has a value of 000001B2y. 
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ATSC_identifier 

This 32-bit value of 47413934, indicates that 
the user data conforms to the ATSC specifica- 
tion. 


User_data_type_code 
The value of this 8-bit codeword is 03. 


Process_em_data_flag 

If this bit is a “1,” the em_data must be pro- 
cessed. If it is a “O,” the em_data can be dis- 
carded. 


Process_cc_data_flag 

If this bit is a “1,” the cc_data must be pro- 
cessed. If it is a “0,” the cc_data can be dis- 
carded. 


Additional_data_flag 
This flag is set to a “1” to indicate the presence 
of additional user data. 


Cc_count 

This 5-bit binary number, with a range of 0-31, 
indicates the number of closed caption con- 
structs following this field. The value is set 
such that a fixed bandwidth of 9600 bps is 
maintained for the closed caption data. 


Em_data 
These eight bits contain an emergency mes- 
sage. They are processed’ only if 


process_em_data_flag is a “1.” 
Note: [cc_count — 1] specifies how many times 
the following five fields are repeated. 


Marker_bits 
These five bits are always “11111.” 


Cc_valid 

If this bit is a “1,” the two closed caption data 
bytes are valid. If it is a “0,” the two data bytes 
are invalid. 


Cc_type 
These two bits specify the type of closed cap- 
tion data that follows. 


Cc_data_1 

The first eight bits of closed caption data. They 
are processed only if process_cc_data_flag is a 
“1.” 


Cc_data_2 

The second eight bits of closed caption data. 
They are processed only if process_cc_data_flag 
isa “1.” 


Marker_bits 
These eight bits are always “11111111.” 


Additional_user_data 
These optional eight bits are present if 
additional_data_flag = “1.” 


Program and System Information 
Protocol (PSIP) 


Enough bandwidth is available within the 
MPEG 2 transport stream to support several 
low-bandwidth non-television services such as 
program guide, closed captioning, weather 
reports, stock indices, headline news, software 
downloads, pay-per-view information, etc. The 
number of additional non-television services 
(virtual channels) may easily reach ten or 
more. In addition, the number and type of ser- 
vices will be constantly changing. 
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To support these non-television services in 
a flexible, yet consistent, manner, the Program 
and System Information Protocol (PSIP) was 
developed. PSIP is a small collection of hierar- 
chically associated tables designed to fit within 
the MPEG 2 transport stream. It describes the 
information for all virtual channels carried in a 
particular MPEG 2 transport stream. Addition- 
ally, information for analog broadcast channels 
may be incorporated. 


Required Tables 

The Master Guide Table (MGT) provides gen- 
eral information about all other tables. It 
defines table sizes necessary for memory allo- 
cation during decoding, version numbers to 
identify tables that need updating, and assigns 
packet identifiers (PIDs) that label the tables. 
The MGT is required to be present in the 
MPEG 2 transport stream. 

The System Time Table (STT) serves as a 
reference for the time of day. Receivers can 
use it to maintain the correct local time. The 
STT is required to be present in the MPEG 2 
transport stream. 

As an example of the bitstream syntax for 
tables, the syntax for the System Time Table 
(STT) is: 


Table_id 
This 8-bit value of CDy identifies the table as a 
System Time Table. 


Section_syntax_indicator 

This bit has a value of “1” to indicate that the 
section that follows has a generic section syn- 
tax beyond the section_length field. 


Private_indicator 
This bit always has a value of “1.” 


Reserved_bits 
These two bits always have a value of “11.” 


Section_length 

This 12-bit binary number specifies the num- 
ber of bytes following this field up to the end of 
the section. The maximum value is 1021. 


Table_id_extension 
This 16-bit string has a value of 0000y. 


Reserved_bits 
These two bits always have a value of “11.” 


Version_number 
This 5-bit binary number has a value of 
“00000.” 


Current_next_indicator 
This bit is always a “1” for a STT, indicating the 
STT data is applicable. 


Section_number 

This 8bit binary number has a value of 004, 
indicating that this table is only one section 
long. 


Last_section_number 
The 8&-bit binary number has a value of 004. 


Protocol_version 
This 8-bit binary number has a value of 00q. 


System_time 

A 32-bit binary number indicating the current 
system time as the number of GPS seconds 
since 12am, January 6th, 1980. 
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GPS_UTC_offset 

An 8-bit binary number that indicates the cur- 
rent offset in whole seconds between the GPS 
and UTC time standards. 


Daylight_savings 
16 bits of daylight savings time control infor- 
mation. 


Descriptors 

One or more descriptors may be present to 
convey additional data or information to the 
receiver. 


CRC_32 
This is a 32-bit CRC value for the table. 


The Rating Region Table (RRT) transmits 
the rating standard in use for each country. 
Provisions are made for different rating sys- 
tems for different countries and multi-country 
regions as well. The RRT is required to be 
present in the MPEG 2 transport stream. 

The Virtual Channel Table (VCT) contains 
a list of all the channels that are or will be avail- 
able plus their attributes. Attributes include 
the channel name, navigation identifiers, 
stream components and types, etc. The VCT is 
required to be present in the MPEG 2 trans- 
port stream. 

There are multiple Event Information 
Tables (EIT), each of which describes the 
events or TV programs associated with each 
virtual channel listed in the VCT. Each EIT is 
valid for three hours. Since there are up to 128 
EITs, up to 16 days of programming may be 
advertised in advance. The first four EITs are 
required to be present in the MPEG 2 trans- 
port stream. Note that this table is not 
required to be present for cable systems. 


Optional Tables 

For text messages, there can be _ several 
Extended Text Tables (ETTs), each having its 
PID defined by the MGT: Messages can 
describe channel information, coming attrac- 
tions, movie descriptions, etc. Each EIT and 
VCT can each have one ETT. 

The Directed Channel Change Table 
(DCCT) contains information needed for a 
channel change to be done at a broadcaster- 
specified time. The requested channel change 
may be unconditional or may be based upon 
criteria specified by the viewer. 

The Directed Channel Change Selection 
Code Table (DCCSCT) permits a broadcast 
program categorical classification table to be 
downloaded for use by some Directed Channel 
Change requests. 


Descriptors 

Much like MPEG 2, the ATSC standard uses 
descriptors to add new functionality. One or 
more of these descriptors may be included 
within the PSIP and MPEG 2 tables (see Table 
14.3) to extend data within the tables. A 
descriptor not recognized by a decoder must 
be ignored by that decoder. This enables new 
descriptors to be implemented without affect- 
ing receivers that cannot recognize and pro- 
cess the descriptors. 

The caption service descriptor provides 
EJA-708 closed captioning data, including 
closed captioning type, language code, and 
widescreen formatting. Up to 16 closed cap- 
tioning services may accompany a program. 

The content advisory descriptor indicates, 
for a given program, the ratings for the rating 
standards defined in the RRT: Ratings may be 
given for any or all defined regions, up to a 
maximum of eight regions per program. 

The extended channel name descriptor pro- 
vides a variable-length channel name for the 
virtual channel. 
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i Terrestrial Broadcast Tables Cable Broadcast Tables 
7 Descriptor 
Descriptor Tag 
PMT MGT VCcT EIT PMT MGT vVCcT EIT 

stuffing 1000 0000 * * * : *. ig * 
AC-3 1000 0001 M M M O 
caption service 1000 0110 O M M O 
content advisory 1000 0111 O M M O 
extended channel 1010 0000 M M 
name 
service location 1010 0001 S M 
Snieyslutied 1010 0010 M M 
service 
component name 1010 0011 M M 

1100 0000 
user private to = * = * * * i * 

1111 1110 


Notes: 
1. PMT: MPEG 2 Program Map Table. 


2. M = when present, required in this table. S = always present in this table for terrestrial broadcast. 
O = may be present in this table also. * = no restrictions. 


Table 14.3. List of Descriptors for PSIP Tables. 


The component name descriptor defines a 
variable-length text-based name for any com- 
ponent of the service. 

The stuffing descriptor is not processed, 
but simply defines a block of [n] bytes as a 
placeholder. 

The time-shifted service descriptor links one 
virtual channel with up to 20 other virtual 
channels carrying the same programming, but 
time-shifted. A typical application is for Near 
Video On Demand (NVOD) services. 

As an example of the bitstream syntax for 
descriptors, the syntax for the Time-Shifted 
Service Descriptor is: 


Descriptor_tag 
This 8-bit value of A2y identifies the descriptor 
as a time-shifted service descriptor. 


Descriptor_length 

This 8bit binary number specifies the length 
(in bytes) of this field up to the end of the 
descriptor. 


Reserved_bits 
These three bits always have a value of “111.” 


Number_of_services 

A 5-bit binary number, with a range of 1-20, 
that indicates the number of time-shifted ser- 
vices defined. 
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Note: [number_of_services — 1] specifies how 
many times the following five fields are repeated. 


Reserved_bits 
These six bits always have a value of “111111.” 


Time_shift 

A 10-bit binary number, with a range of 1-720, 
that indicates the number of minutes the time- 
shifted service is time-shifted. 


Reserved_bits 
These four bits always have a value of “1111.” 


Major_channel_number 

A 10-bit binary number, with a range of 1-999, 
indicating the “major” channel number associ- 
ated with the time-shifted service. 


Minor_channel_number 

A 10-bit binary number, with a range 0-999. 
that (when non-zero) indicates the “minor” 
channel number associated with the time- 
shifted service. 


Adding Future Data Services 


The MPEG 2 transport layer used by the ATSC 
standard allows new ancillary services to eas- 
ily be added in the future. These ancillary ser- 
vices are transmitted as private data, and are 
supported in two bitstream locations: 


1. Within the adaptation header of trans- 
port packets 


2. As a separate transport stream 


Terrestrial Transmission Format 


The VSB (vestigial sideband) modulation sys- 
tem offers two modes: a terrestrial broadcast 
mode and a high data rate mode. The terres- 
trial broadcast mode provides maximum cover- 
age, supporting one HDTV signal or multiple 
SDTV signals in a 6 MHz channel. The high 
data rate mode supports two HDTV signals or 
multiple SDTV signals in a 6 MHz channel, 
trading off robustness for a higher data rate. 
The two modes share the same pilot, symbol 
rate, data frame structure, interleaving, Reed- 
Solomon coding, and synchronization pulses. 

VSB modulation requires only processing 
the in-phase (I) channel signal, sampled at the 
symbol rate. The decoder only requires one 
ADC and a real (not complex) equalizer operat- 
ing at the symbol rate of 10.76M samples per 
second. 


8-VSB Overview 
Figure 14.1 illustrates a simplified block dia- 
gram of the 8-VBS modulation process. 


Data Synchronization 

Initially, an 8-VSB (8level vestigial sideband) 
modulator must synchronize to the incoming 
MPEG 2 transport bitstream packets. Before 
any processing can be done, it must correctly 
identify the start and end points of each MPEG 
2 packet by using its sync byte. This sync byte 
is then discarded and replaced with an ATSC 
segment sync later in the processing stage. 


Data Randomizer 

In general, the 8-VSB signal must have a ran- 
dom, noise-like nature. This results in a flat, 
noise-like spectrum that maximizes channel 
bandwidth efficiency and reduces possible 
interference with other DTV and NTSC chan- 
nels. 
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MPEG 2 REED 
FRAME DATA DATA 
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Figure 14.1. Simplified Block Diagram of a 8-VSB Modulator. 


To accomplish this, the data randomizer 
uses a pseudo-random number generator to 
scramble the bitstream. The receiver reverses 
this process to recover the original data. 


Reed-Solomon Encoding 

Reed-Solomon (RS) encoding provides for- 
ward error correction (FEC) for the data 
stream. 

The RS encoder takes the 187 bytes of an 
incoming MPEG 2 packet (remember, the 
packet sync byte has been removed), calcu- 
lates 20 parity bytes (known as Reed-Solomon 
parity bytes), and adds them to the end of the 
packet. 

The receiver compares the received 187 
bytes to the 20 parity bytes in order to deter- 
mine the validity of the recovered data. If 
errors are detected, the receiver can use the 
parity bytes to locate and correct the errors. 


Data Interleaver 

The data interleaver scrambles the sequential 
order of the data stream in order to minimize 
sensitivity to burst-type interference. It does 
this by assembling new data packets that incor- 
porate portions of many different MPEG 2 
packets. These new data packets are the same 
length as the original MPEG 2 packets: 207 
bytes after RS coding. 


Data interleaving is done according to a 
known pattern. The process is reversed in the 
receiver in order to recover the proper data. 


Trellis Encoder 

Trellis coding is another form of forward error 
correction, known as convolutional coding, 
that tracks the data stream over time. Twelve 
different Trellis encoders operate in parallel, 
providing another form of interleaving to pro- 
vide further protection against burst-type inter- 
ference. 

In general, each &-bit data word is split into 
a stream of four, 2-bit words. A 3-bit binary 
code is mathematically generated to describe 
the transition from previous 2-bit words to the 
current one. For every two bits that go into the 
trellis coder, three bits come out. For this rea- 
son, the trellis coder in the 8-VSB system is 
said to be a 2/3 rate coder. 

These 3-bit codes are substituted for the 
original 2-bit words and transmitted as one of 
the eight level symbols of 8-VSB. 

The receiver’s trellis decoder uses the 3-bit 
transition codes to reconstruct the evolution of 
the data stream from one 2-bit word to the 
next. 
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Sync and Pilot Insertion 

The next step is the insertion of various 
“helper” signals (ATSC pilot, segment sync, 
and field sync) that aid the 8-VSB receiver to 
locate and demodulate the transmitted signal. 
These signals are inserted after the randomiza- 
tion and error coding stages so as not to 
destroy the relationships that these signals 
must have in order to be effective. 

The first “helper” signal is the ATSC pilot, 
which gives the 8-VSB receiver something to 
lock on to independent of the transmitted data. 
Just before modulation, a small DC offset is 
added to the 8VSB baseband signal. This 
causes a small residual carrier to appear at the 
zero frequency point of the resulting modu- 
lated spectrum. This is the ATSC pilot. 

Another “helper” signal is the ATSC seg- 
ment sync. A data segment is composed of the 
207 bytes of an interleaved data packet. After 
trellis coding, the 207-byte segment has been 
converted into a baseband stream of 828 eight- 
level symbols. The ATSC segment sync is a 
four-symbol pulse that is added to the front of 
each data segment to replace the missing sync 
byte of the original MPEG 2 data packet. The 
segment sync thus appears once every 832 
symbols. 8-VSB receivers can detect the repeti- 
tive nature of the ATSC segment sync, recover 
it, and use it to help regenerate the sample 
clock. 

The last “helper” signal is the ATSC field 
sync. 313 consecutive data segments comprise 
a data field. The ATSC field sync is one data 
segment that is added at the beginning of each 
data field. It has a known symbol pattern and is 
used by the 8-VSB receiver to eliminate signal 
ghosts caused by reflections. This can be done 
by comparing the received ATSC field sync 
against the known field sync sequence. The 
difference is used to adjust the taps of the 
receiver’s ghost-canceling equalizer. 


AM Modulation 

The eight-level baseband signal is then ampli- 
tude modulated onto an intermediate fre- 
quency (IF) carrier. This creates a large, 
double-sideband spectrum about the carrier 
frequency that is far too wide to be transmitted 
in a 6 MHz channel. A Nyquist VSB filter trims 
the bandwidth down to fit within the channel 
by removing redundant information. 

As a result of the overhead added to the 
data stream due to forward error correction 
coding and sync insertion, the bit rate has 
gone from 19.39 Mbps to 32.28 Mbps at the 
output of the trellis coder. Since three bits are 
transmitted in each symbol, the resulting sym- 
bol rate is 10.76 million symbols per second. 
This can be transmitted with a minimum fre- 
quency bandwidth of 5.38 MHz, which fits 
nicely within a6 Mhz channel bandwidth. 

After the Nyquist VSB filter, the 8-VSB 
intermediate frequency (IF) signal is up-con- 
verted to the assigned UHF or VHF channel 
frequency. 


DVB 


The DVB standard is actually a group of many 
standards, the major ones being: 


ETSI EN 300 421: Framing Structure, 
Channel Coding and Modulation for 11/ 
12 GHz Satellite Services 


ETSI EN 300 429: Framing Structure, 
Channel Coding and Modulation for 
Cable Systems 


ETSI EN 300 468: Specification for Service 
Information (SI) in DVB Systems 
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ETSI EN 300 472: Specification for Convey- 
ing ITU-R System B Teletext in DVB Bit- 
streams 


ETSI EN 300 744: Framing Structure, 
Channel Coding and Modulation for Digi- 
tal Terrestrial Television 


ETSI EN 301 192: Specification for Data 
Broadcasting 


ETSI EN 301 775: Standard for Conveying 
VBI Data in DVB Bitstreams 


ETSI ETS 300 743: Subtitling Systems 


Like the ATSC standard, DVB transmits 
compressed digital video, compressed digital 
audio, and ancillary data. Terrestrial channel 
bandwidths of 6, 7, and 8 MHz are supported 
to address the needs of different countries that 
may adopt the standard. 


Video Capability 


Although any resolution may be used as long 
as the bit rate is not exceeded, there are many 
resolutions that will probably be the most pop- 
ular. These are indicated in Table 14.2. Both 
interlaced and progressive pictures are permit- 
ted for most of the resolutions. 

Video compression is based on MPEG 2. 
However, there are some minor constraints on 
some of the MPEG 2 parameters, as discussed 
within the chapter on MPEG 2. 


Audio Capability 


Audio compression standards supported 
include Dolby Digital (AC-3), MPEG 1 Layers 
1 and 2, and MPEG 2 Layers 1 and 2. 


Subtitles 


Subtitles enable specified text or bit-mapped 
graphics to appear on the screen at a specified 
time and location. ETSI ETS 300 743 defines 
how subtitles are implemented. 

A single PES packet may carry several dif- 
ferent subtitle streams, allowing support for 
multiple languages, 4:3 and 16:9 aspect ratio 
displays, etc. The PTS (Presentation Time 
Stamp) in the PES packet provides presenta- 
tion timing information for the subtitling data. 

The building block of the subtitling capa- 
bility is the subtitling_segment. These seg- 
ments are carried in MPEG 2 PES packets as 
private_stream_1 which are in turn carried by 
transport packets. 

The syntax of the PES data field in the sub- 
titling PES packets is: 


Data_identifier 
This 8-bit field has a value of 20} to indicate 
subtitling. 


Subtitle_stream_id 

This 8-bit binary number has a value of 00y, 
and identifies the subtitle stream in this PES 
packet. 


Subtitling_segment 
One or more subtitling segment fields are 
located here. 


End_of_pes_data_field_marker 
This 8-bit field has a value of FFy. 
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Subtitling segments include Page Compo- 
sition Segment, Region Composition Segment, 
Color Lookup Table (CLUT) Definition Seg- 
ment, Object Data Segment, and private data. 
Combined, these segments enable specified 
text or bit-mapped graphics to appear on the 
screen at a specified time and location. 


VBI Data 


ESTI EN 301 775 (which replaces ETSI EN 300 
472) defines how to transmit many types of 
NTSC/PAL VBI data, such as widescreen sig- 
nalling (WSS), closed captioning, teletext, and 
video program system (VPS). This VBI data is 
carried in MPEG 2 PES packets as 
private_stream_1 which are in turn carried by 
transport packets. 

In addition to the VBI data types reviewed 
here, there is also the ability to transmit up to 
720 samples of 8-bit Y data. This enables the 
transfer of VBI data not defined by the current 
standard. 


Closed Captioning 

This data field can be used to transfer the 16 
bits of captioning information that may be 
present on lines 21 and 284 of NTSC systems 
(EIA-608). The syntax of the closed captioning 
data field is: 


Data_unit_id 

This 8-bit binary number identifies the type of 
data present. For closed captioning data, it has 
a value of C5y. 


Data_unit_length 
This 8-bit binary number indicates the number 
of bytes following this field. 


Reserved_bits 
These two bits always have a value of “11.” 


Field_parity 
A “1” for this bit indicates field 1 (line 21) data; 
a “0” indicates field 2 (line 284) data. 


Line_offset 

This 5-bit binary number specifies the line 
number the caption data is to be inserted on 
for a NTSC video signal. Equipment will only 
generate captioning data fields with this 5-bit 
binary number equal to “01101” or 21p. Decod- 
ers need only implement line_offset when it is 
“01101,” they may ignore other lines. 


Closed_captioning_data_block 

This 16-bit field corresponds to the 16 bits of 
closed captioning data, excluding the clock 
run-in signal. 


EBU and Inverted Teletext 

This data field can be used to transfer the 42 
data bytes of teletext information that may be 
present in PAL systems (ETSI ETS 300 706). 
The syntax of the teletext data field is: 


Data_unit_id 

This 8-bit binary number identifies the type of 
data present. For teletext data, it has a value of 
024, 034, COy, or Cly. 


Data_unit_length 
This 8-bit binary number indicates the number 
of bytes following this field. 


Reserved_bits 
These two bits always have a value of “11.” 


Field_parity 
A “1” for this bit indicates field 1 data; a “0” 
indicates field 2 data. 
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Line_offset 

This 5-bit binary number specifies the line 
number the teletext data is to be inserted on 
for a PAL video signal. Only values of “00000” 
and “00111” to “10110” are valid. When 
field_parity = “0”, a value of 313p is added to 
the line_offset value to obtain the line number. 


Framing_code 

This 8-bit field specifies the framing code to be 
used for a PAL video signal. For EBU teletext, 
it has a value of “11100100.” For inverted tele- 
text, it has a value of “00011011.” 


Txt_data_block 
This 336-bit field corresponds to the 336 bits of 
teletext data, excluding the run-in and framing 
code sequence. 


Video Program System (VPS) 

This data field can be used to transfer the 13 
data bytes of video program system informa- 
tion that may be present on line 16 of PAL sys- 
tems (ETSI ETS 300 231). The syntax of the 
VPS data field is: 


Data_unit_id 

This 8-bit binary number identifies the type of 
data present. For VPS data, it has a value of 
C3y. 


Data_unit_length 
This 8-bit binary number indicates the number 
of bytes following this field. 


Reserved_bits 
These two bits always have a value of “11.” 


Field_parity 

Equipment will generate VPS data fields with 
this bit set to a “1.” Decoders need only imple- 
ment field_parity when it is a “1.” They may 
ignore packets when this bit is a “0.” 


Line_offset 

This 5-bit binary number specifies the line 
number the VPS data is to be inserted on for a 
PAL video signal. Equipment will only gener- 
ate VPS data fields with this 5-bit binary num- 
ber equal to “01000” or 16p. Decoders need 
only implement line_offset when it is “01000,” 
they may ignore other lines. 


Vps_data_block 

This 104-bit field corresponds to the 104 bits of 
wide-screen-signalling data, excluding the run- 
in and start code byte. 


Widescreen Signalling (WSS) 

This data field can be used to transfer the 14 
bits of widescreen signalling information that 
may be present on line 23 of PAL systems 
(ETSI EN 300 294). The syntax of the WSS 
data field is: 


Data_unit_id 

This 8-bit binary number identifies the type of 
data present. For WSS data, it has a value of 
C4y. 


Data_unit_length 
This 8-bit binary number indicates the number 
of bytes following this field. 


Reserved_bits 
These two bits always have a value of “11.” 
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Field_parity 

Equipment will generate WSS data fields with 
this bit set to a “1.” Decoders need only imple- 
ment field_parity when it is a “1.” They may 
ignore packets when this bit is a “0.” 


Line_offset 

This 5-bit binary number specifies the line 
number the WSS data is to be inserted on for a 
PAL video signal. Equipment will only gener- 
ate WSS data fields with this 5-bit binary num- 
ber equal to “10111” or 23p. Decoders need 
only implement line_offset when it is “10111.” 
They may ignore other lines. 


Wss_data_block 
This 14-bit field corresponds to the 14 bits of 
wide-screen-signalling data. 


Reserved_bits 
These two bits always have a value of “11.” 


Data Broadcasting 


ESTI EN 301 192 defines how to transmit vari- 
ous types of non-video and non-audio data over 
a DVB system. Examples for data broadcast- 
ing, also call “datacasting,” are the software 
downloads, delivery of Internet services using 
IP tunnelling, interactive TV, etc. 

Five different application areas with differ- 
ent broadcast requirements have been identi- 
fied. For each application area, a profile is 
defined. 


Data Piping 

This profile supports data broadcast services 
that use a simple, asynchronous, end-to-end 
delivery of data. Data is carried directly in the 
payloads of MPEG 2 transport stream packets. 


Data Streaming 

This profile supports data broadcast services 
that use a streaming-oriented, end-to-end deliv- 
ery of data in either an asynchronous, synchro- 
nous, or synchronized way. Data is carried in 
MPEG 2 PES packets. 


Multiprotocol Encapsulation 

This profile supports data broadcast services 
that use the transmission of datagrams of com- 
munication protocols. The transmission of dat- 
agrams is done by encapsulating the 
datagrams in MPEG 2 DSM-CC sections. 


Data Carousels 

This profile supports data broadcast services 
that use the periodic transmission of data mod- 
ules. These modules are of known sizes and 
may be updated, added to, or removed from 
the data carousel in time. Data is broadcast 
using a MPEG 2 DSM-CC Data Carousel. 


Object Carousels 

This profile supports data broadcast services 
that use the periodic broadcasting of DSM-CC 
User-User (U-U) Objects. Data is broadcast 
using the MPEG 2 DSM-CC Object Carousel 
and DSM-CC Data Carousel. 


Service Information (SI) 


ETSI EN 300 468 specifies the Service Infor- 
mation (SI) data which forms a part of DVB 
bitstreams. This provides the user with infor- 
mation to assist in selecting available services, 
and enables a receiver to automatically config- 
ure itself for the selected service. The method 
of information presentation is not specified, 
allowing receiver manufacturers to choose 
appropriate presentation methods. SI data is 
mostly carried as MPEG 2 Program Specific 
Information (PSI). 
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The structure of the SI data is similar to 
that PSIP structure for the ATSC standard. 
Various tables convey where information is 
located, and most tables may also contain one 
or more descriptors to convey additional data. 


Required Tables 

The Network Information Table (NIT) provides 
information about the physical network, 
including any grouping of transport streams 
and the relevant tuning information. It can be 
used during receiver set-up and the relevant 
tuning information stored in non-volatile mem- 
ory. The NIT can also be used to signal 
changes of tuning information. It is transmitted 
at least once every 10 seconds. 

The Service Description Table (SDT) 
describes the available services, such as the 
service names, the service providers, etc. It is 
transmitted at least once every 2 seconds (10 
seconds for other transport streams). 

There are multiple Event Information 
Tables (EIT), each of which describes the avail- 
able events or programs. The Present/Follow- 
ing Table is transmitted at least once every 2 
seconds (10 seconds for other transport 
streams). The Schedule Table for the first 8 
days is transmitted at least once every 10 sec- 
onds. The Schedule Table for further than 8 
days ahead is transmitted at least once every 
30 seconds. 

The Time and Date Table (TDT) contains 
the actual UTC-time coded as Modified Julian 
Date (MJD). Receivers can use it to maintain 
the correct local time. It is transmitted at least 
once every 30 seconds. 


Optional Tables 

The Bouquet Association Table (BAT) provides 
information regarding bouquets. Along with 
the name of the bouquet, it provides a list of 
services for each bouquet. A bouquet is a 
group of services that may traverse the net- 


work boundary. If present, it is transmitted at 
least once every 10 seconds. 

The Time Offset Table (TOT) is the same 
as the TDT, except it includes local time offset 
information. If present, it is transmitted at least 
once every 30 seconds. 

The Running Status Table (RST) updates 
the running status of one or more events. 
These are sent out only once, at the time of an 
event status change, unlike other tables which 
are usually transmitted repeatedly. 


Descriptors 

Much like MPEG 2 and ATSC, DVB uses 
descriptors to add new functionality. One or 
more of these descriptors may be included 
within most tables to extend data within the 
tables. A descriptor not recognized by a 
decoder must be ignored by that decoder. This 
enables new descriptors to be implemented 
without affecting receivers that cannot recog- 
nize and process the descriptors. 

The announcement support descriptor 
informs about announcements supported by 
the service. 

The bouquet name descriptor conveys the 
name of the bouquet the indicated services are 
allocated to, for example, “The News Chan- 
nel”, “The Sports Channel,” etc. 

The CA identifier descriptor identifies one 
or more conditional access systems which 
apply to the services. 

The cell frequency link descriptor describes 
the terrestrial network. It provides links 
between a cell and the frequencies that are 
used in this cell. 

The cell list descriptor lists the cells of the 
terrestrial network. 

The component descriptor specifies streams 
that are attached to an event. 

The content descriptor is used to classify 
the content of the event. 
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The country availability descriptor indi- 
cates if the bouquet or service is available in a 
specific country. 

The frequency list descriptor lists any addi- 
tional frequencies used for transmission. 

The linkage descriptor is used to link to 
another service or transport stream. 

The local time offset descriptor indicates the 
local time offset. This can be used for an auto- 
matic adjustment between regular and daylight 
savings time by a receiver. 

The multilingual bouquet name descriptor 
conveys the name of the bouquet in one or 
more languages. 

The multilingual component descriptor con- 
veys text describing an event in one or more 
languages. 

The multilingual network name descriptor 
conveys the name of the network in one or 
more languages. 

The multilingual service descriptor conveys 
the name of the service provider and service 
name in one or more languages. 

The network name descriptor conveys the 
name of a physical network—for example, 
“Alpha Broadcast”, “Beta Cable,” etc. 

The NVOD reference descriptor lists Near 
Video On Demand (NVOD) services. 

The parental rating descriptor provides a 
rating of the program based on age, content, 
language, or other criteria. 

The short event descriptor transmits the 
name and a short text description for an event. 
A language code also specifies the language 
used. 

The service descriptor contains the basic 
text identification of a service, such as service 
and provider name. 

The service list descriptor lists the services 
and service types for each transport stream 
that belongs to this bouquet or service. 


The service move descriptor enables a 
receiver to track a service that moves from one 
transport stream to another. 

The subtitling descriptor identifies subtitle 
data, including specifying the language and 
other information for the subtitle content. 

The teletext descriptor identifies teletext 
data, including specifying the language and 
“reference” pages, such as subtitle or index 
pages. 

The time shifted service descriptor identifies 
a service as a time-shifted copy of another ser- 
vice. 

The VBI descriptor indicates the transmis- 
sion of data intended to be transcoded into the 
VBI of NTSC or PAL video. Such data includes 
teletext, VPS, WSS, and closed captioning. 
Also, a generic format for transmitting lumi- 
nance-only data is defined to have a means of 
coping with other types of VBI data. 

The VBI teletext descriptor is used when 
conveying both VBI data and EBU teletext data 
using the same elementary stream. 


Terrestrial Transmission Format 


The COFDM (Coded Orthogonal Frequency 
Division Multiplexing) system is used for ter- 
restrial broadcast. Note that COFDM is not a 
modulation method; it is a multiplexing 
method. 

Frequency division multiplexing (the 
“FDM” in COFDM) means that the data is dis- 
tributed over many carriers, as opposed to 
using a single carrier. As a result, the data rate 
transmitted on each COFDM carrier is much 
lower than the data rate required of a single 
high-speed carrier. All the COFDM carriers 
are orthogonal (the “O” in COFDM), or mutu- 
ally perpendicular. Coded (the “C” in COFDM) 
means that forward error correction is incor- 
porated. 
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Standard modulation methods, such as 
QPSK, 16-QAM or 64-QAM, are used to modu- 
late the COFDM carriers. 

Two modes of operation are defined. A “2K 
mode” is available for single transmitter opera- 
tion and for small SFN (Single Frequency Net- 
work) networks with limited transmitter 
distances. An “8K mode” can be used for both 
single transmitter operation and small or large 
SFN networks. 

The system also allows different levels of 
QAM modulation and different inner code 
rates that can be used to trade off bit rate ver- 
sus ruggedness. 

Figure 14.2 illustrates a simplified block 
diagram of the COFDM process. 


COFDM Overview 

Basic OFDM spreads the transmitted data 
over thousands of carriers. The “2K mode” 
generates 2048 carriers, resulting in 1705 
active carriers remaining after removal of car- 
riers that might be subject to co-channel and 
adjacent-channel interference. The “8K mode” 
generates 8192 carriers, with 6817 active carri- 
ers. Data symbols on each carrier are 
arranged to occur simultaneously. The num- 
ber of bits carried by each modulation symbol 
is dependent on the choice of modulation: for 
example, 2 bits per symbol for QPSK, 4 bits per 
symbol for 16-QAM, etc. 

The carriers have a common, precise fre- 
quency spacing to ensure orthogonality of the 
carriers. Thus, the OFDM demodulator for 
one carrier doesn’t “see” the modulation of the 
others, reducing crosstalk. 

Another refinement is the guard interval to 
reduce inter-symbol and inter-carrier interfer- 
ence caused by echoes. Naturally, using a 
guard interval reduces the data capacity 
slightly. 


The process of simultaneously modulating 
or demodulating thousands of carriers is 
equivalent to Discrete Fourier Transform oper- 
ations, for which efficient Fast Fourier Trans- 
form (FFT) algorithms exist. Thus, the 
implementation is not as difficult as it might 
first appear. 


Adaptation 
Initially, the COFDM system must synchronize 


to the incoming MPEG 2 transport bitstream 
packets. Before any processing can be done, it 
must correctly identify the start and end points 
of each MPEG 2 packet by using its sync byte. 


Energy Dispersal 

In general, the COFDM signal must have a 
random, noise-like nature. This results in a flat, 
noise-like spectrum that maximizes channel 
bandwidth efficiency and reduces possible 
interference with other DTV and NTSC/PAL 
channels. 

To accomplish this, the data randomizer 
uses a pseudo-random number generator to 
scramble the bitstream. This receiver reverses 
this process to recover the original data. 


Outer Coder 

Like the ATSC standard, DVB uses Reed- 
Solomon (RS) encoding to provide forward 
error correction (FEC) for the data stream. 

The RS encoder takes the 188 bytes of an 
incoming MPEG 2 packet, calculates 16 parity 
bytes (known as Reed-Solomon parity bytes), 
and adds them to the end of the packet. This 
results in an error-protected packet length of 
204 bytes. 

The receiver compares the received 188 
bytes to the 16 parity bytes in order to deter- 
mine the validity of the recovered data. If 
errors are detected, the receiver can use the 
parity bytes to locate and correct the errors. 
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Figure 14.2. Simplified Block Diagram of a COFDM Modulator. 


Outer Interleave 

Convolutional byte-wise interleaving with a 
depth (1) of 12 is then applied to the error-pro- 
tected packets. The interleaved data bytes are 
then delimited by normal or inverted MPEG 2 
sync bytes. The periodicity of the 204 bytes is 
preserved. 


Inner Coder 

A range of punctured convolutional codes may 
be used, based on a mother convolutional code 
of rate one-half with 64 states. This part of the 
coding process can operate at many rates: 1/2, 
2/3,3/4, or 7/8. 


Inner Interleave 

This process consists of bit-wise interleaving 
followed by symbol interleaving. Both are 
block-based. 

The bit interleaver scrambles the sequen- 
tial order of the data stream to minimize sensi- 
tivity to burst-type interference. The bitstream 
is demultiplexed into [n] sub-streams, where n 
= 2 for QPSK, n = 4 for 16-QAM, and n = 6 for 


64-QAM. Each sub-stream is processed by a 
separate bit interleaver, with a bit interleaving 
block size of 126 bits. The block interleaving 
process is therefore repeated twelve times per 
OFDM symbol in the 2K mode and forty-eight 
times per symbol in the 8K mode. 

The symbol interleaver maps [n] bit words 
onto the 1512 (2K mode) or 6048 (8K mode) 
active carriers per OFDM symbol. In the 2K 
mode, 12 groups of 126 data words are used; 
for the 8K mode, 48 groups of 126 data words 
are used. 


Mapper 
All data carriers in one OFDM frame are either 


QPSK, 16-QAM, 64-QAM, non-uniform-16- 
QAM, or non-uniform-64-QAM using Gray 
mapping. 

The transmitted signal is then organized 
into frames, with each frame consisting of 68 
OFDM symbols. Four frames constitute a 
super-frame. Since the OFDM signal uses 
many separately modulated carriers, each 
symbol can be further divided into cells. 
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Frame Adaptation 
In addition to the audio, video, and ancillary 
data, an OFDM frame contains: 


Scattered pilot cells 
Continual pilot carriers 
TPS carriers 


The pilots are used for frame synchroniza- 
tion, frequency synchronization, time synchro- 
nization, channel estimation, transmission 
mode identification, and to follow phase noise. 

Various cells within a OFDM frame are 
also modulated with reference data whose 
value is known to the receiver. Cells containing 
this reference data are transmitted at an 
increased power level. 

The TPS (Transmission Parameter Signal- 
ling) carriers are used to convey information 
related to the transmission scheme, such as 
channel coding and modulation. The TPS is 
transmitted over 17 carriers for the 2K mode 
and over 68 carriers for the 8K mode (at the 
rate of 1 bit per carrier). The TPS carriers con- 
vey information on: 


Modulation including the value of the 
QAM constellation pattern 


Hierarchy information 
Guard interval 

Inner code rates 
Transmission mode 


Frame number in a super-frame 


Cable Transmission Format 


The QAM (Quadrature Amplitude Modula- 
tion) system is used for cable broadcast. 16- 
QAM, 32-QAM, 64-QAM, 128-QAM and 256- 
QAM are supported. Receivers must support 
at least 64-QAM modulation. 

The optional cable return path uses 
Quadrature Phase Shift Keying (QPSK) modu- 
lation in a 200 kHz, 1 MHz, or 2 MHz channel 
to provide a return path of up to about 3 Mbps. 
The path to the user may be either in-band 
(embedded in the MPEG-2 transport stream in 
the DVB-C channel) or out-of-band (on a sepa- 
rate 1 or 2 MHz frequency band). 


Satellite Transmission Format 


The QPSK (Quaternary Phase Shift Keying) 
modulation system is used for satellite broad- 
cast. 
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chapter 15 cr 
CDROM Contents 


The included CDROMs contain files to assist 
in testing and evaluating various video sub- 
systems. 


Still Directory 


These files contain still images at various reso- 
lutions that can be loaded into a frame buffer, 
NTSC or PAL encoded, and displayed on a TV 
or monitor. They allow the evaluation of the 
video subsystem, including the NTSC/PAL 
encoder. Figures 15.1 through 15.23 show the 
test images. 

These TIFF files contain either 24-bit lin- 
ear RGB data with a range of 0-255 or 24-bit 
gamma-corrected RGB data with a range of 
16-235. 

Note the color bars do not contain the -7.5 
IRE (NTSC) or -2 IRE (PAL) PLUGE signal 
due to the limitations of using the RGB color 
space. The I and Q values are also slightly 
incorrect due to the limitations of using the 
RGB color space. 

The filenames are listed in Table 15.1. 


Usage of Still Test Images 


100% Color Bars. This test signal is used to 
measure the hue and color saturation accu- 
racy. The transitions between colors are deter- 
mined by the quality of any filters in the signal 
path. The sharper the transitions are without 
ringing, the better the filters. Since this pattern 
is computer-generated, there may be flicker 
along horizontal edges when viewed on a TV 
or an interlaced monitor. Unless the video sig- 
nal is RF modulated, it should pass through 
the system with no problems. If it is RF modu- 
lated, the yellow and cyan colors will be 
affected. 


75% Color Bars. This test signal is also used to 
measure the hue and color saturation accu- 
racy. Again, the transitions between colors are 
affected by the quality of any filters in the sig- 
nal path. The sharper the transitions are with- 
out ringing, the better the filters. Since this 
pattern is computer-generated, there may be 
flicker along horizontal edges when viewed on 
a TV or an interlaced monitor. Even if the video 
signal is RF modulated, it should pass through 
the system with no problems. 
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Resolution 
isesle 640 x 320 x 240 x 720 x 352 x Rane 
480 240 180 480 240 

100% color bars 2000 2100 2200 2300 2400 0-255 
75% color bars 2001 2101 2201 2301 2401 0-255 
100% color wheel 2002 2102 2202 2302 2402 0-255 
75% red frame 2003 2103 2203 2303 2403 0-255 
linear Y ramp 2004 2104 2204 2304 2404 0-255 
75% Y bars 2005 2105 2205 2305 2405 0-255 
10-step luminance staircase 2006 2106 2206 2306 2406 0-255 
needle 2007 2107 2207 2307 2407 0-255 
color test 2008 2108 2208 2308 2408 0-255 
linear yellow ramp 2009 2109 2209 2309 2409 0-255 
overscan measurement 2010 2110 - 2310 2410 0-255 
text samples 2011 2111 2211 2311 2411 0-255 
100% fast edges 2012 2112 2212 2312 2412 0-255 
100% anti-aliased edges 2013 2113 2213 2313 2413 0-255 
75% fast edges 2014 2114 2214 2314 2414 0-255 
75% anti-aliased edges 2015 2115 2215 2315 2415 0-255 
88% fast edges 2016 2116 2216 2316 2416 0-255 
88% anti-aliased edges 2017 2117 2217 2317 2417 0-255 
classroom picture 2018 2118 2218 2318 2418 0-255 
anti-aliased classroom picture 2019 2119 2219 2319 2419 0-255 
100% color bars - - - 2320 - 16-235 
75% color bars - - - 2321 - 16-235 
multiburst - - - 2322 - 16-235 


Table 15.1a. File Names for 24-bit RGB Still Image Test Files. 
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Resolution 
Teche 768 x 384 x 720 x 352 x Ranie 
576 288 576 288 

100% color bars 3000 3100 3300 3400 0-255 
75% color bars 3001 3101 3301 3401 0-255 
100% color wheel 3002 3102 3302 3402 0-255 
75% red frame 3003 3103 3303 3403 0-255 
linear Y ramp 3004 3104 3304 3404 0-255 
75% Y bars 3005 3105 3305 3405 0-255 
10-step luminance staircase 3006 3106 3306 3406 0-255 
needle 3007 3107 3307 3407 0-255 
color test 3008 3108 3308 3408 0-255 
linear yellow ramp 3009 3109 3309 3409 0-255 
overscan measurement 3010 3110 3310 3410 0-255 
text samples 3011 3111 3311 3411 0-255 
100% fast edges 3012 3112 3312 3412 0-255 
100% anti-aliased edges 3013 3113 3313 3413 0-255 
75% fast edges 3014 3114 3314 3414 0-255 
75% anti-aliased edges 3015 3115 3315 3415 0-255 
88% fast edges 3016 3116 3316 3416 0-255 
88% anti-aliased edges 3017 3117 3317 3417 0-255 
classroom picture 3018 3118 3318 3418 0-255 
anti-aliased classroom picture 3019 3119 3319 3419 0-255 
100% color bars - - 3320 - 16-235 
75% color bars - - 3321 - 16-235 
multiburst - - 3322 - 16-235 


Table 15.1b. File Names for 24-bit RGB Still Image Test Files. 
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Figure 15.1. Test Signal xx00. 


Figure 15.2. Test Signal xx01. 
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Figure 15.3. Test Signal xx02. 


Figure 15.4. Test Signal xx03. 
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Figure 15.5. Test Signal xx04. 


Figure 15.6. Test Signal xx05. 
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Figure 15.7. Test Signal xx06. 


Figure 15.8. Test Signal xx07. 
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Figure 15.9. Test Signal xx08. 


Figure 15.10. Test Signal xx09. 
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DPD >>> >> DDD >I>D DDD DDD DDD DDD DDD 


Figure 15.11. Test Signal xx10. 


ABCDEFGHIJKLMNOPQRSTUYWKYZ 


ABCDEFGHIJKLMNOPQRSTUYWKYZ 


ABCDEFGHIJKLMNOPQRSTUYWKYZ 


ABCDEFGHIJKLMNOPQRSTUVWKXYZ 


ABCDEFGHIJKLMNOPORSTUVWXYZ 


ABCDEFGHIJKLMNOPQRSTUVWXYZ 


Figure 15.12. Test Signal xx11. 
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Figure 15.13. Test Signal xx12. 


Figure 15.14. Test Signal xx13. 
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Figure 15.15. Test Signal xx14. 


Figure 15.16. Test Signal xx15. 
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Figure 15.17. Test Signal xx16. 


Figure 15.18. Test Signal xx17. 


CDROM Contents 677 


Figure 15.19. Test Signal xx18. 


Figure 15.20. Test Signal xx19. 
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Figure 15.21. Test Signal xx20. 


Figure 15.22. Test Signal xx21. 
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Figure 15.23. Test Signal xx22. 


100% Color Wheel. This nonstandard test sig- 
nal may be used to visually see how well the 
system handles highly saturated colors. Unless 
the video signal is RF modulated, it should 
pass through the system with no problems. 


75% Red Frame. This test signal is used to visu- 
ally look for static noise present in the system. 
Even if it is RF modulated, it should pass 
through the system with no problems. 


Linear Y Ramp. This test signal is used to test 
luma nonlinearity and the linearity of any A/D 
and D/A converters present in the system. 
Even if it is RF modulated, it should pass 
through the system with no problems. 


75% Y Bars. This test signal is also used to mea- 
sure luma nonlinearity and to perform monitor 
adjustment. Since this pattern is computer-gen- 
erated, there may be flicker along horizontal 
edges when viewed on a TV or an interlaced 
monitor. Even if it is RF modulated, it should 
pass through the system with no problems. 


10-step Y Staircase. This test signal is also used 
to measure luma nonlinearity. Even if it is RF 
modulated, it should pass through the system 
with no problems. 


Needle. This test signal allows determining the 
frequency response of the system. Even if the 
video signal is RF modulated, it should pass 
through the system with no problems. 
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Color Test. This test signal contains a multi- 
burst pattern to help determine the frequency 
response of the system. It also allows the eval- 
uation of color reproduction and fine detail. 
Even if the video signal is RF modulated, it 
should pass through the system with no prob- 
lems. 


Linear Yellow Ramp. This nonstandard test sig- 
nal may be used to assist with determining the 
ability of the system to handle the full range of 
video levels without clipping or compression. 
Unless the video signal is RF modulated, it 
should pass through the system with no prob- 
lems. 


Overscan Measurement. This test signal may 
be used to visually determine the amount of 
overscan a TV or monitor has. Even if the 
video signal is RF modulated, it should pass 
through the system with no problems. 


Text Samples. This nonstandard test signal con- 
tains both normal and anti-aliased text of sev- 
eral sizes, and illustrates the need for “fat, 
wide” text for video. Notice that the anti-aliased 
text is difficult to read on a TV or interlaced 
monitor due to its being too thin. Even if the 
video signal is RF modulated, it should pass 
through the system with no problems. 


100% Fast Edges. This nonstandard test signal 
may be used to determine how any filters in 
the system respond to fast edges of highly sat- 
urated (100%) colors. With good filters, there 
should be no ringing along the color edges. 
Use of sharp cut-off filters results in ringing. 
Since this pattern is computer-generated, 
there may be flicker along horizontal edges 
when viewed on a TV or an interlaced monitor. 
Unless the video signal is RF modulated, it 
should pass through the system with no prob- 
lems. 


100% Anti-Aliased Edges. This nonstandard 
test signal is a Gaussian-filtered version of the 
“100 Fast Edges,” and illustrates how using 
slow edges removes flicker in interlaced dis- 
plays and any ringing on color edges due to fil- 
tering. Unless the video signal is RF 
modulated, it should pass through the system 
with no problems. 


75% Fast Edges. This nonstandard test signal 
may be used to determine how any filters in 
the system respond to fast edges of normal 
(75%) colors. With good filters, there should be 
no ringing along the color edges. Use of sharp 
cut-off filters results in ringing. Since this pat- 
tern is computer-generated, there may be 
flicker along horizontal edges when viewed on 
a TV or an interlaced monitor. Even if the video 
signal is RF modulated, it should pass through 
the system with no problems. 


75% Anti-Aliased Edges. This nonstandard test 
signal is a Gaussian-filtered version of the “75% 
Fast Edges,” and illustrates how using slow 
edges removes flicker in interlaced displays 
and any ringing on color edges due to filtering. 
Even if the video signal is RF modulated, it 
should pass through the system with no prob- 
lems. 


88% Fast Edges. This nonstandard test signal 
may be used to determine how any filters in 
the system respond to fast edges of highly sat- 
urated (88%) colors. The colors are the equiva- 
lent of 75% linear RGB values, common in 
computer graphics. With good filters, there 
should be no ringing along the color edges. 
Since this pattern is computer-generated, 
there may be flicker along horizontal edges 
when viewed on a TV or an interlaced monitor. 
Unless the video signal is RF modulated, it 
should pass through the system with no prob- 
lems. 
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88% Anti-Aliased Edges. This nonstandard test 
signal is a Gaussian-filtered version of the “88% 
Fast Edges,” and illustrates how using slow 
edges removes flicker in interlaced displays 
and any ringing on color edges due to filtering. 
Unless the video signal is RF modulated, it 
should pass through the system with no prob- 
lems. 


Classroom Picture. This nonstandard test sig- 
nal may be used to determine how the system 
responds to very fine detail, various sharp 
edges, subtle shading, and highly saturated 
colors. There should be no ringing along the 
color edges and little loss of detail. There may 
be some flicker when viewed on a TV or an 
interlaced monitor due to the fine detail and 
sharp edges present. Unless the video signal is 
RF modulated, it should pass through the sys- 
tem with no problems. 


Anti-Aliased Classroom Picture. This nonstand- 
ard test signal is a Gaussian-filtered version of 
“Classroom Picture,” and illustrates how using 
slow edges removes flicker in interlaced dis- 
plays. There should be very little loss of detail 
and no ringing on color edges. Unless the 
video signal is RF modulated, it should pass 
through the system with no problems. 


H261 Directory 


This directory contains source code for a 
H.261 software encoder and decoder. Also 
included are sample video bitstreams. 


H263 Directory 


This directory contains source code for a 
H.263 software encoder and decoder. Also 
included are sample video bitstreams. 


MPEG_1 Directory 


This directory contains source code for a 
MPEG 1 video encoder, video decoder, system 
bitstream multiplexer, and system bitstream 
demultiplexer. Also included are sample video 
and system bitstreams. 


MPEG_2 Directory 


This directory contains source code for a 
MPEG 2 video encoder and video decoder. 
Also included are sample video and system bit- 
streams. 


Sequence Directory 


Various image sequences at various resolu- 
tions are provided as test source material for 
H.261, H.263, MPEG 1 and MPEG 2 encoders. 
This directory is split over the two CDROMs 
due to space limitations. 


Jitter Directory 


All file types are 24-bit gamma-corrected RGB 
TIFF (the RGB range is 0-255) and are 10 
frames long. They must be converted to the 
proper YCbCr format for driving a H.261, 
H.263, MPEG 1, or MPEG 2 encoder. Two res- 
olutions are represented: 352 x 240 and 352 x 
288. For each resolution, three versions of 
sequence images are provided: 0 ns, 5 ns, and 
10 ns. 
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Usage 

The sequence of images in the “0 ns” directory 
may be used to evaluate the performance of a 
video compression system assuming an ideal 
NTSC/PAL decoder (no jitter) and no video 
noise (ideal source). 

The sequence of images in the “5 ns” direc- 
tory may be used to evaluate the performance 
of a video compression system assuming a stu- 
dio-quality NTSC/PAL decoder (5 ns of pixel 
clock jitter) and little video noise (up to +38 
codes out of 255). 

The sequence of images in the “10 ns” 
directory may be used to evaluate the perfor- 
mance of a video compression system assum- 
ing a consumer-quality NTSC/PAL decoder 
(10 ns of pixel clock jitter) and typical video 
noise (up to +6 codes out of 255). 


By comparing the compression ratios of 
the three image sequences, the quality of the 
temporal processing may be evaluated. With 
no temporal preprocessing, the “0 ns” and “5 
ns” sequences should yield similar compres- 
sion ratios. The “10 ns” sequence will have a 
poorer compression ratio. With good temporal 
preprocessing, all three sequences should 
yield similar compression ratios. This demon- 
strates that with better NTSC/PAL decoders, 
or by adding temporal preprocessing, higher 
compression ratios can be easily achieved. 
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8-VSB 
AC-3 


AC’97, AC’98 


AC Coupled 


Active Video 


A/D, ADC 


AFC 


AGC 


Alpha 


Glossary 


See Vestigial Sideband. 
An early name for Dolby Digital. 


These are definitions by Intel for the audio I/O implementation for PCs. Two 
chips are defined: an analog audio I/O chip and a digital controller chip. The 
digital chip will eventually be replaced by a software solution. The goal is to 
increase the audio performance of PCs and lower cost. 


AC coupling passes a signal through a capacitor to remove any DC offset, or 
the overall voltage level that the video signal “rides” on. One way to find the 
signal is to remove the DC offset by AC coupling, and then do DC restoration 
to add a known DC offset (one that we selected). Another reason AC coupling 
is important is that it can remove large (and harmful) DC offsets. 


The part of the video waveform that contains picture information. Most of the 
active video, if not all of it, is visible on the display. 


Analog-to-Digital Converter. This device is used to digitize audio and video. 
An ADC for digitizing video must be capable of sampling at 10 to 150 million 
samples per second (MSPS). 

See Automatic Frequency Control. 


See Automatic Gain Control. 


See Alpha Channel and Alpha Mix. 
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Alpha 
Channel 


Alpha Mix 


AM 


Amplitude 
Modulation 


Ancillary 
Timecode 


Anti-Alias 
Filter 


Aperture Delay 


Aperture 
Jitter 


Artifacts 


The alpha channel is used to specify an alpha value for each video sample. The 
alpha value is used to control the blending, on a sample-by-sample basis, of 
two images. 


new sample = (alpha) (sample A color) + (1 — alpha) (sample B color) 


Alpha typically has a normalized value of 0 to 1. In a computer environment, 
the alpha values can be stored in additional memory. When you hear about 32- 
bit frame buffers, what this really means is that there are 24 bits of color, 8 
each for red, green, and blue, along with an 8-bit alpha channel. Also see 
Alpha Mix. 


This is a way of combining two images. How the mixing is performed is speci- 
fied by the alpha channel. The little box that appears over the left-hand shoul- 
der of a news anchor is put there by an alpha mixer. Wherever the little box is 
to appear, a “1” is put in the alpha channel. Wherever it doesn't appear, a “0” is 
used. When the alpha mixer sees a “1” coming from the alpha channel, it dis- 
plays the little box. Whenever it sees a “0,” it displays the news anchor. Of 
course, it doesn’t matter if a “1” or a “0” is used, but you get the point. 


See Amplitude Modulation. 


A method of encoding data onto a carrier, such that the amplitude of the car- 
rier is proportional to the data value. 


ITU-R BT.1366 defines how to transfer VITC and LTC as ancillary data in digi- 
tal component interfaces. 


A lowpass filter used to bandwidth-limit a signal to less than one-half the sam- 
pling rate. 


Aperture delay is the time from an edge of the input clock of the ADC until the 
time the ADC actually takes the sample. The smaller this number, the better. 


The uncertainty in the aperture delay. This means the aperture delay time 
changes a little bit each time, and that little bit of change is the aperture jitter. 


In the video domain, artifacts are blemishes, noise, snow, spots, whatever. 
When you have an image artifact, something is wrong with the picture from a 
visual standpoint. Don't confuse this term with not having the display properly 
adjusted. For example, if the hue control is set wrong, the picture will look 
bad, but this is not an artifact. An artifact is some physical disruption of the 
image. 
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Aspect Ratio 


Asynchronous 
ATC 


ATSC 


ATSC A/49 
ATSC A/52 


ATSC A/53, 
ATSC A/54 


ATSC A/57 
ATSC A/63 
ATSC A/65 


Audio 
Modulation 


Audio 
Subcarrier 


Automatic 
Frequency 
Control (AFC) 


Automatic 
Gain 


Control (AGC) 


Back Porch 


The ratio of the width of the picture to the height. Displays commonly have a 
4:3 or 16:9 aspect ratio. Program material may have other aspect ratios (such 
as 20:9), resulting in it being “letterboxed” on the display. 

Refers to circuitry without a common clock or timing signal. 

See Ancillary Timecode. 

Advanced Television Systems Committee. They defined the SDTV and HDTV 
standards for the United States, using MPEG 2 for video and Dolby Digital for 
audio. Other countries are also adopting the ATSC HDTV standard. 

Defines the ghost cancellation reference signal for NTSC. 


Defines the Dolby Digital audio compression for ATSC HDTV. 


Defines ATSC HDTV for the United States. 


Defines the program, episode, and version ID for ATSC HDTV. 
Defines the method for handling 25 and 50 Hz video for ATSC HDTV. 
Defines the program and system information protocol for ATSC HDTV. 


Refers to modifying an audio subcarrier with audio information so that it may 
be mixed with the video information and transmitted. 


A specific frequency that is modulated with audio data. 


A technique to lock onto and track a desired frequency. 


A circuit that has a constant output amplitude, regardless of the input ampli- 
tude. 


The portion of the video waveform between the trailing edge of the horizontal 
sync and the start of active video. 
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Bandpass 
Filter 


Bandwidth 
(BW) 
Bandwidth 
Segmented 
Orthogonal 
Frequency 
Division 
Multiplexing 


Baseband 


BBC 


BITC 


Black Burst 


Black Level 


Blanking 


A circuit that allows only a selected range of frequencies to pass through. 


The range of frequencies a circuit will respond to or pass through. It may also 
be the difference between the highest and lowest frequencies of a signal. 


BST-OFDM attempts to improve on COFDM by modulating some OFDM car- 
riers differently from others within the same multiplex. A given transmission 
channel may therefore be “segmented,” with different segments being modu- 
lated differently. 


When applied to audio and video, baseband means an audio or video signal 
that is not modulated onto another carrier (such as RF modulated to channel 3 
or 4 for example). In DTV, baseband also may refer to the basic (unmodu- 
lated) MPEG 2 program or system stream. 


British Broadcasting Corporation. 


Burned-In Time Code. The timecode information is displayed within a portion 
of the picture, and may be viewed on any monitor or TV. 


Black burst is a composite video signal with a totally black picture. It is used to 
synchronize together video equipment so the video outputs are aligned. Black 
burst tells the video equipment the vertical sync, horizontal sync, and the 
chroma burst timing. 


This level represents the darkest an image can get, defining what black is for a 
particular video system. If for some reason the video goes below this level, it is 
referred to as blacker-than-black. You could say that sync is blacker-than- 
black. 


On a CRT display, the scan line moves from the left edge to the right edge, 
jumps back to the left edge, and starts out all over again, on down the screen. 
When the scan line hits the right side and is about to be brought back to the 
left side, the video signal is blanked so that you can’t “see” the return path of 
the scan beam from the right to the left-hand edge. To blank the video signal, 
the video level is brought down to the blanking level, which is below the black 
level if a pedestal is used. 
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Blanking Level 


Blooming 


Breezeway 


Brightness 


BS.707 


BST-OFDM 


BT.470 


BT.601 


BT.653 


BT.656 


BT.709 


That level of the video waveform defined by the system to be where blanking 
occurs. This could be the black level if a pedestal is not used or below the 
black level if a pedestal is used. 


This is an effect, sometimes caused when video becomes whiter-than-white, in 
which a line that is supposed to be nice and thin becomes fat and fuzzy on the 
screen. 


That portion of the video waveform between the trailing edge of horizontal 
sync and the start of color burst. 


This refers to how much light is emitted by the display, and is controlled by 
the intensity of the video level. 


This ITU recommendation specifies the stereo audio specifications (Zweiton 
and NICAM 728) for the PAL and SECAM video standards. 


Short for Bandwidth Segmented Orthogonal Frequency Division Multiplex- 
ing. 


This ITU recommendation specifies the various NTSC, PAL, and SECAM 
video standards used around the world. SMPTE 170M also specifies the (M) 
NTSC video standard used in the United States. BT.470 has replaced BT.624. 


This ITU recommendation specifies the 720 x 480 (59.94 Hz), 960 x 480 (59.94 
Hz), 720 x 576 (50 Hz), and 960 x 576 (50 Hz) 4:2:2 YCbCr interlaced stan- 
dards. 


This ITU recommendation defines the various teletext standards used around 
the world. Systems A, B, C, and D for both 525-line and 625-line TV systems 
are defined. 


This ITU recommendation defines a parallel interface (8-bit or 10-bit, 27 MHz) 
and a serial interface (270 Mbps) for the transmission of 4:3 BT.601 4:2:2 
YCbCr digital video between pro-video equipment. Also see SMPTE 125M. 


This ITU recommendation specifies the 1920 x 1080 RGB and 4:2:2 YCbCr 
interlaced and progressive 16:9 digital video standards. Frame refresh rates of 
60, 59.94, 50, 30, 29.97, 25, 24, and 23.976 Hz are supported. 
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BT.799 


BT.1119 


BT.1124 


BT.1197 


BT.1302 


BT.1303 


BT.1304 


BT.1305 


BT.1358 


BT.1362 


BT.1364 


BT.1365 


This ITU recommendation defines the transmission of 4:3 BT.601 4:4:4:4 YCb- 
CrK and RGBK digital video between pro-video equipment. Two parallel inter- 
faces (8-bit or 10-bit, 27 MHz) or two serial interfaces (270 Mbps) are used. 


This ITU recommendation defines the widescreen signaling (WSS) informa- 
tion for NTSC and PAL video signals. For (B, D, G, H, I) PAL systems, WSS 
may be present on line 23, and on lines 22 and 285 for (M) NTSC. 


This ITU recommendation defines the ghost cancellation reference (GCR) 
signal for NTSC and PAL. 


This ITU recommendation defines the PALplus standard, allowing the trans- 
mission of 16:9 programs over normal PAL transmission systems. 


This ITU recommendation defines the transmission of 16:9 BT.601 4:2:2 
YCbCr digital video between pro-video equipment. It defines a parallel inter- 
face (8-bit or 10-bit, 36 MHz) and a serial interface (860 Mbps). 


This ITU recommendation defines the transmission of 16:9 BT.601 4:4:4:4 
YCbCrK and RGBK digital video between pro-video equipment. Two parallel 
interfaces (8-bit or 10-bit, 36 MHz) or two serial interfaces (8360 Mbps) are 
used. 


This ITU recommendation specifies the checksum for error detection and sta- 
tus for pro-video digital interfaces. 


This ITU recommendation specifies the digital audio format for ancillary data 
for pro-video digital interfaces. Also see SMPTE 272M. 


This ITU recommendation defines the 720 x 480 (59.94 Hz) and 720 x 576 (50 
Hz) 4:2:2 YCbCr pro-video progressive standards. Also see SMPTE 293M. 


This ITU recommendation defines the pro-video serial interface for the trans- 
mission of BT.1358 digital video between equipment. Two 270 Mbps serial 
interfaces are used. 


This ITU recommendation specifies the ancillary data packet format for pro- 
video digital interfaces. Also see SMPTE 291M. 


This ITU recommendation specifies the 24-bit digital audio format for pro- 
video HDTV serial interfaces. Also see SMPTE 299M. 
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BT.1366 


BTSC 


Burst 


Burst Gate 


Carrier 
CATV 
CBC 


CCIR 


Chaoji 
VideoCD 


Checksum 


Chroma 


Chroma 
Bandpass 


This ITU recommendation specifies the transmission of timecode as ancillary 
data for pro-video digital interfaces. Also see SMPTE 266M. 


A technique of implementing stereo audio for NTSC video. One FM subcar- 
rier transmits a L+R signal, and an AM subcarrier transmits a L-R signal. 


See Color Burst. 


This is a signal that tells a video decoder where the color burst is located 
within the scan line. 


The blue-minus-luma signal, also called a color difference signal. When added 
to the luma (Y) signal, it produces the blue video signal. 


A frequency that is modulated with data to be transmitted. 
Community antenna television, now generally meaning cable TV. 
Canadian Broadcasting Corporation. 


Comite Consultatif International des Radiocommunications or International 
Radio Consultative Committee. The CCIR no longer exists—it has been 
absorbed into the parent body, the ITU. For a given “CCIR xxx” specification, 
see “BT.xxx.” 


Another name for Super VideoCD. 


An error-detecting scheme which is the sum of the data values transmitted. 
The receiver computes the sum of the received data values and compares it to 
the transmitted sum. If they are equal, the transmission was error-free. 


The NTSC, PAL, or SECAM video signal contains two parts that make up what 
you see on the display: the intensity part, and the color part. Chroma is the 
color part. 


In a NTSC or PAL video signal, the luma (black and white) and the chroma 
(color) information are combined together. If you want to decode an NTSC or 
PAL video signal, the luma and chroma must be separated. A chroma band- 
pass filter removes the luma from the video signal, leaving the chroma rela- 
tively intact. This works reasonably well except in images where the luma and 
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Chroma Burst 


Chroma 
Demodulator 


Chroma Key 


Chroma Trap 


chroma information overlap, meaning that we have luma and chroma stuff at 
the same frequency. The filter can’t tell the difference between the two and 
passes everything. This can make for a funny-looking picture. Next time 
you’re watching TV and someone is wearing a herringbone jacket or a shirt 
with thin, closely spaced stripes, take a good look. You may see a rainbow 
color effect moving through that area. What’s happening is that the video 
decoder thinks that the luma is chroma. Since the luma isn't chroma, the 
video decoder can’t figure out what color it is and it shows up as a rainbow pat- 
tern. This problem can be overcome by using a comb filter. 


See Color Burst. 


After the NTSC or PAL video signal makes its way through the Y/C separator, 
the colors must be decoded. That’s what a chroma demodulator does. It takes 
the chroma output of the Y/C separator and recovers two color difference sig- 
nals (typically I and Q or U and V). Now, with the luma information and two 
color difference signals, the video system can figure out what colors to dis- 


play. 


This is a method of combining two video images. An example of chroma key- 
ing in action is the nightly news person standing in front of a giant weather 
map. In actuality, the person is standing in front of a blue or green background 
and their image is mixed with a computer-generated weather map. This is how 
it works: a TV camera is pointed at the person and fed along with the image of 
the weather map into a box. Inside the box, a decision is made. Wherever it 
sees the blue or green background, it displays the weather map. Otherwise, it 
shows the person. So, whenever the person moves around, the box figures out 
where he is, and displays the appropriate image. 


In a NTSC or PAL video signal, the luma (black and white) and the chroma 
(color) information are combined together. If you want to decode the video 
signal, the luma and chroma must be separated. The chroma trap is one 
method for separating the chroma from the luma, leaving the luma relatively 
intact. How does it work? The NTSC or PAL signal is fed to a trap filter. For all 
practical purposes, a trap filter allows certain frequencies to pass through, but 
not others. The trap filter is designed with a response to remove the chroma 
so that the output of the filter only contains the luma. Since this trap stops 
chroma, it’s called a chroma trap. The sad part about all of this is that not only 
does the filter remove chroma, it removes luma as well if it exists within the 
frequencies where the trap exists. The filter only knows ranges and, depend- 
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Chrominance 


CIF 


Clamp 


Clipping Logic 


Closed 
Captioning 


ing on the image, the luma information may overlap the chroma information. 
The filter can’t tell the difference between the luma and chroma, so it traps 
both when they are in the same range. What’s the big deal? Well, you lose 
luma and this means that the picture is degraded somewhat. Using a comb fil- 
ter for a Y/C separator is better than a chroma trap or chroma bandpass. 


In video, the terms chrominance and chroma are commonly (and incorrectly) 
interchanged. See the definition of Chroma. 


Common Interface Format or Common Image Format. The Common Inter- 
face Format was developed to support video conferencing. It has an active res- 
olution of 352 x 288 and a refresh rate of 29.97 frames per second. The High- 
Definition Common Image Format (HD-CIF) is used for HDTV production 
and distribution, having an active resolution of 1920 x 1080 with a frame 
refresh rate of 23.976, 24, 29.97, 30, 50, 59.94, or 60 Hz. 


This is basically another name for the DC-restoration circuit. It can also refer 
to a switch used within the DC-restoration circuit. When it means DC restora- 
tion, then it’s usually used as “clamping.” When it’s the switch, then it’s just 
“clamp.” 


A circuit used to prevent illegal conversion. Some colors can exist in one color 
space but not in another. Right after the conversion from one color space to 
another, a color space converter might check for illegal colors. If any appear, 
the clipping logic is used to limit, or clip, part of the information until a legal 
color can be represented. Since this circuit clips off some information and is 
built using logic, it's not too hard to see how the name “clipping logic” was 
developed. 


A service which decodes text information transmitted with the video signal 
and displays it on the display. For NTSC, the caption signal may be present on 
lines 21 and 284. For PAL, the caption signal may be present on lines 22 and 
334. See the EIA-608 specification for (M) NTSC usage of closed captioning 
and the EJA-708 specification for DTV support. 

For digital transmissions such as HDTV and SDTV, the closed caption 
characters are multiplexed as a separate stream along with the video and 
audio data. It is common practice to actually embed this stream in the MPEG 
video bitstream itself, rather than at the transport layer. Unfortunately there is 
no current standard for this caption stream—each system (DSS, DVB, ATSC, 
DVD) has its own solution. 
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The practical place in MPEG 2 to put caption data is in the user_data field, 
which can be placed at various places within the video stream. For DVD, cap- 
tion data may be 8-bit user_data in the group_of_pictures header (525/60 sys- 
tems), a digitized caption signal (quantized to 16 levels) that is processed as 
normal video data (625/50 systems), or a subpicture that is simply decoded 
and mixed with the decoded video. For ATSC broadcasts, the &bit data is 
inserted in the user_data field of individual picture headers (up to 60 times per 
second). 


See subtitles. 


This is a color space primarily used in color printing. CMYK is an acronym for 
Cyan, Magenta, Yellow, and blacK. The CMYK color space is subtractive, 
meaning that cyan, magenta, yellow and black pigments or inks are applied to 
a white surface to remove color information from the white surface to create 
the final color. The reason black is used is because even if a printer could print 
hues of cyan, magenta, and yellow inks perfectly enough to make black 
(which it can’t for large areas), it would be too expensive since colored inks 
cost more than black inks. So, when black is used, instead of putting down a 
lot of CMY, they just use black. 


Coded orthogonal frequency division multiplexing, or COFDM, transmits dig- 
ital data differently than 8-VSB or other single-carrier approaches. Frequency 
division multiplexing means that the data to be transmitted is distributed over 
many carriers (1705 or 6817 for DVB-T), as opposed to modulating a single 
carrier. Thus, the data rate on each COFDM carrier is much lower than that 
required of a single carrier. The COFDM carriers are orthogonal, or mutually 
perpendicular, and forward error correction (“coded”) is used. 

COFDM is a multiplexing technique rather than a modulation technique. 
One of any of the common modulation methods, such as QPSK, 16-QAM or 
64-QAM, is used to modulate the COFDM carriers. 


See Coded Orthogonal Frequency Division Multiplexing. 
This is a test pattern used to check whether a video system is calibrated cor- 


rectly. A video system is calibrated correctly if the colors are the correct 
brightness, hue, and saturation. This can be checked with a vectorscope. 
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A waveform of a specific frequency and amplitude that is positioned between 
the trailing edge of horizontal sync and the start of active video. The color 
burst tells the color decoder how to decode the color information contained in 
that line of active video. By looking at the color burst, the decoder can deter- 
mine what’s blue, orange, or magenta. Essentially, the decoder figures out 
what the correct color is. 


See Chroma Demodulator. 


See Chroma Demodulator. 


All of the color spaces used in color video require three components. These 
might be RGB, YIQ, YUV or Y(R-Y) (B-Y). In the Y(R-Y) (B-Y) color space, 
the R-Y and B-Y components are often referred to as color difference signals 
for obvious reasons. They are made by subtracting the luma (Y) from the red 
and blue components. I and Q and U and V are also color difference signals 
since they are scaled versions of R-Y and B-Y. All the Ys in each of the YIQ, 
YUV and Y(R-Y) (B-Y) are basically the same, although they are slightly dif- 
ferent between SDTV and HDTV. 


Extraneous colors that appear along the edges of objects, but don’t have a 
color relationship to those areas. 


The color encoder does the exact opposite of the color decoder. It takes two 
color difference signals, such as I and Q or U and V, and combines them into a 
chroma signal. 


This is essentially the same thing as Chroma Key. 


A color killer is a circuit that shuts off the color decoding if the incoming video 
does not contain color information. How does this work? The color killer looks 
for the color burst and if it can’t find it, it shuts off the color decoding. For 
example, let’s say that a color TV is going to receive material recorded in 
black and white. Since the black and white signal does not contain a color 
burst, the color decoding is shut off. Why is a color killer used? Well, in the 
old days, the color decoder would still generate a tiny little bit of color if a 
black and white transmission was received, due to small errors in the color 
decoder, causing a black and white program to have faint color spots through- 
out the picture. 
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See Color Encoder. 


This term is used to describe how close a color is to the theoretical. For exam- 
ple, in the Y'UV color space, color purity is specified as a percentage of satura- 
tion and +q, where q is an angle in degrees, and both quantities are referenced 
to the color of interest. The smaller the numbers, the closer the actual color is 
to the color that it’s really supposed to be. For a studio-grade device, the satu- 
ration is +2% and the hue is +2 degrees. On a vectorscope, if you’re in that 
range, you’re studio quality. 


A color space is a mathematical representation for a color. No matter what 
color space is used—RGB, YIQ, YUV, etc.—orange is still orange. What 
changes is how you represent orange. For example, the RGB color space is 
based on a Cartesian coordinate system and the HSI color space is based on a 
polar coordinate system. 


The name Toshiba uses for the analog YPbPr video interface on their con- 
sumer equipment. If the interface supports progressive SDTV resolutions, it is 
called ColorStream Pro. If the interface supports HDTV resolutions, it is called 
ColorStream HD. 


The color subcarrier is a signal used to control the color encoder or color 
decoder. For (M) NTSC, the frequency of the color subcarrier is about 3.58 
MHz and for (B, D, G, H, D PAL it’s about 4.43 MHz. In the color encoder, a 
portion of the color subcarrier is used to create the color burst, while in the 
color decoder, the color burst is used to reconstruct a color subcarrier. 


Color temperature is measured in degrees Kelvin. If a TV has a color tempera- 
ture of 8,000 degrees Kelvin, that means the whites have the same shade as a 
piece of pure carbon heated to that temperature. Low color temperatures have 
a shift towards red; high color temperatures have a shift towards blue. 

The standard for video is 6,500 degrees Kelvin. Thus, professional TV 
monitors use a 6,500-degree color temperature. However, most consumer TVs 
have a color temperature of 8,000 degrees Kelvin or higher, resulting in a blu- 
ish cast. By adjusting the color temperature of the TV, more accurate colors 
are produced, at the expense of picture brightness. 


This is another method of performing Y/C separation. A comb filter is used in 
place of a chroma bandpass or chroma trap. The comb filter provides better 
video quality since it does a better job of separating the luma from chroma. It 
reduces the amount of creepy-crawlies or zipper artifacts. It’s called a comb fil- 
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ter because the frequency response looks like a comb. The important thing to 
remember is that the comb filter is a better method for Y/C separation than 
chroma bandpass or chroma trap. 


See CIF. 


See CIF. 


Video using three separate color components, such as YCbCr or RGB. 


A single video signal that contains brightness, color, and timing information. If 
a video system is to receive video correctly, it must have several pieces of the 
puzzle in place. It must have the picture that is to be displayed on the screen, 

and it must be displayed with the correct colors. This piece is called the active 
video. The video system also needs information that tells it where to put each 

pixel. This is called sync. The display needs to know when to shut off the elec- 
tron beam so the viewer can't see the spot retrace across the CRT display. 
This piece of the video puzzle is called blanking. Now, each piece could be 
sent in parallel over three separate connections, and it would still be called 
video and would still look good on the screen. This is a waste, though, 
because all three pieces can be combined together so that only one connec- 
tion is needed. Composite video is a video stream that combines all of the 
pieces required for displaying an image into one signal, thus requiring only 
one connection. NTSC and PAL are examples of composite video. Both are 
made up of active video, horizontal sync, horizontal blanking, vertical sync, 
vertical blanking, and color burst. RGB is not an example of composite video, 

even though each red, green, and blue signal may contain sync and blanking 
information, because all three signals are required to display the picture with 
the right colors. 


Compression ratio is a number used to tell how much information is squeezed 
out of an image when it has been compressed. For example, suppose we start 
with a 1 MB image and compress it down to 128 kB. The compression ratio 
would be: 


1,048,576 / 131,072 = 8 
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This represents a compression ratio of 8:1; 1/8 of the original amount of 
storage is now required. For a given compression technique—MPEG, for 
example—the higher the compression ratio, the worse the image looks. This 
has nothing to do with which compression method is better, for example 
JPEG vs. MPEG. Rather, it depends on the application. A video stream that is 
compressed using MPEG at 100:1 may look better than the same video stream 
compressed to 100:1 using JPEG. 


This is an image artifact caused by not having enough bits to represent the 
image. The reason the effect is called “contouring” is because the image 
develops vertical bands of brightness. 


A video term referring to how far the whitest whites are from the blackest 
blacks in a video waveform. If the peak white is far away from the peak black, 
the image is said to have high contrast. With high contrast, the image is very 
stark and very “contrasty,” like a black-and-white tile floor. If the two are very 
close to each other, the image is said to have poor, or low, contrast. With poor 
contrast, an image may be referred to as being “washed out”—you can’t tell 
the difference between white and black, and the image looks gray. 


Yes, this is a real video term! Creepy-crawlies refers to a specific image artifact 
that is a result of the NTSC system. When the nightly news is on, and a little 
box containing a picture appears over the anchorperson’s shoulder, or when 
some computer-generated text shows up on top of the video clip being shown, 
get up close to the TV and check it out. Along the edges of the box, or along 
the edges of the text, you'll notice some jaggies “rolling” up (or down) the pic- 
ture. That’s the creepy-crawlies. Some people refer to this as zipper because it 
looks like one. 


This occurs when the video decoder incorrectly interprets high-frequency 
luma information (brightness) to be chroma information (color), resulting in 


color being displayed where it shouldn’t. 


This occurs when the video decoder incorrectly interprets chroma informa- 
tion (color) to be high-frequency luma information (brightness). 


A condition when one signal erroneously modulates another signal. 


Interference from one signal that is detected on another. 
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These are short for Digital-to-Analog Converter. 


Abbreviation for decibels, a standard unit for expressing relative power, volt- 
age, or current. 


Measure of power in communications. 0 dBm = 1 mW, with a logarithmic rela- 
tionship as the values increase or decrease. In a 50-ohm system, 0 dBm = 
0.223 volts. 


Decibels referenced to 1 watt. 


DC restoration is what you have to do to a video signal after it has been AC- 
coupled and has to be digitized. Since the video waveform has been AC-cou- 
pled, we no longer know absolutely where it is. For example, is the bottom of 
the sync tip at -5v or at 1v? In fact, not only don’t we know where it is, it also 
changes over time, since the average voltage level of the active video changes 
over time. Since the ADC requires a known input level and range to work 
properly, the video signal needs to be referenced to a known DC level. DC res- 
toration essentially adds a known DC level to an AC-coupled signal. In decod- 
ing video, the DC level used for DC restoration is usually such that when the 
sync tip is digitized, it will be generate the number 0. 


This is short for Discrete Cosine Transform, used in the MPEG, H.261, and 
H.263 video compression algorithms. 


One-tenth of a Bel, used to define the ratio of two powers, voltages, or cur- 
rents, in terms of gains or losses. It is 10x the log of the power ratio and 20x 
the voltage or current ratio. 


When a video signal is digitized so that 100 samples are produced, but only 
every other one is stored or used, the signal is decimated by a factor of 2:1. 
The image is now 1/4 of its original size, since 3/4 of the data is missing. If 
only one out of five samples were used, then the image would be decimated by 
a factor of 5:1, and the image would be 1/25 its original size. Decimation, then, 
is a quick-and-easy method for image scaling. 

Decimation can be performed in several ways. One way is the method just 
described, where data is literally thrown away. Even though this technique is 
easy to implement and cheap, it introduces aliasing artifacts. Another method 
is to use a decimation filter, which reduces the aliasing artifacts, but is more 
costly to implement. 
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A decimation filter is a lowpass filter designed to provide decimation without 
the aliasing artifacts associated with simply throwing data away. 


Also referred to as post-emphasis and post-equalization. De-emphasis per- 
forms a frequency-response characteristic that is complementary to that intro- 
duced by pre-emphasis. 


A circuit used to restore a frequency response to its original form. 


The process of recovering an original signal from a modulated carrier. 


In NTSC and PAL video, demodulation is the technique used to recover the 
color difference signals. See the definitions for Chroma Demodulator and 
Color Decoder; these are two other names for the demodulator used in 
NTSC/PAL video applications. Demodulation is also used after DTV tuners to 
convert the transmitted DTV signal to a baseband MPEG 2 stream. 


Differential gain is how much the color saturation changes when the luma 
level changes (it isn’t supposed to). For a video system, the better the differen- 
tial gain—that is, the smaller the number specified—the better the system is 
at figuring out the correct color. 


Differential phase is how much the hue changes when the luma level changes 
(it isn’t supposed to). For a video system, the better the differential phase— 
that is, the smaller the number specified—the better the system is at figuring 
out the correct color. 


Digital 8 compresses video using standard DV compression, but records it in a 
manner that allows it to use standard Hi-8 tape. The result is a DV “box” that 
can also play standard Hi-8 and 8 mm tapes. On playback, analog tapes are 
converted to a 25 Mbps compressed signal available via the i-Link digital out- 
put interface. Playback from analog tapes has limited video quality. New 
recordings are digital and identical in performance to DV; audio specs and 
other data also are the same. 


Digital video using three separate color components, such as YCbCr or RGB. 


Digital video that is essentially the digitized waveform of NTSC or PAL video 
signals, with specific digital values assigned to the sync, blank, and white lev- 
els. 
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An encryption method (also known as “5C”) developed by Sony, Hitachi, Intel, 
Matsushita and Toshiba for IEEE 1394 interfaces. 


Digital VCRs are similar to analog VCRs in that tape is still used for storage. 
Instead of recording an analog audio/video signal, digital VCRs record digital 
signals, usually using compressed audio/video. 


See DVD-Video and DVD-Audio. 


DVITC digitizes the analog VITC waveform to generate 8-bit values. This 
allows the VITC to be used with digital video systems. For 525-line video sys- 
tems, it is defined by SMPTE 266M. BT.1366 defines how to transfer VITC 
and LTC as ancillary data in digital component interfaces. 


A DCTis just another way to represent an image. Instead of looking at it in the 
time domain—which, by the way, is how we normally do it—it is viewed in the 
frequency domain. It’s analogous to color spaces, where the color is still the 
color but is represented differently. Same thing applies here—the image is 
still the image, but it is represented in a different way. 

Why do JPEG, MPEG, H.261, and H.263 base part of their compression 
schemes on the DCT? Because it is more efficient to represent an image that 
way. In the same way that the YCbCr color space is more efficient than RGB 
in representing an image, the DCT is even more efficient at image representa- 
tion. 


A discrete time oscillator is a digital version of the voltage-controlled oscilla- 
tor. 
An audio compression technique developed by Dolby. It is a multi-channel 


surround sound format used in DVD and HDTV. 


The distance between screen pixels measured in millimeters. The smaller the 
number, the better the horizontal resolution. 
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As the name implies, you are using two buffers—for video, this means two 
frame buffers. While buffer 1 is being read, buffer 2 is being written to. When 
finished, buffer 2 is read out while buffer 1 is being written to. 


A circuit used to change a high-frequency signal to a lower frequency. 
The frequency satellites use to transmit data to Earth stations. 


This method is identical to the sync suppression technique for scrambling 
analog TV channels, except there is no suppression of the horizontal blanking 
intervals. Sync pulse suppression only takes place during the vertical blanking 
interval. The descrambling pulses still go out for the horizontal blanking inter- 
vals (to fool unauthorized descrambling devices). If a descrambling device is 
triggering on descrambling pulses only, and does not know that the scrambler 
is using the drop field scrambling technique, it will try to reinsert the horizon- 
tal intervals (which were never suppressed). This is known as double reinser- 
tion, which causes compression of the active video signal. An unauthorized 
descrambling device creates a washed-out picture and loss of neutral sync 
during drop field scrambling. 


Short for Digital Transmission Content Protection. 


DTS stands for Digital Theater Systems. It is a multi-channel surround sound 
format, similar to Dolby Digital. For DVDs that use DTS audio, the DVD- 
Video specification still requires that PCM or Dolby Digital audio still be 
present. In this situation, only two channels of Dolby Digital audio may be 
present (due to bandwidth limitations). 


Short for digital television, including SDTV, EDTV, and HDTV. 


Short for digital video broadcast, a method of transmitting digital audio and 
video (SDTV or HDTV resolution), based on MPEG 2. There are several vari- 
ations: DVB-T for terrestrial broadcasting (ETSI EN 300 744), DVB-S for satel- 
lite broadcasting (ETSI EN 300 421), and DVB-C for cable broadcasting (ETSI 
EN 300 429). Both MPEG and Dolby Digital compressed audio are supported. 

DVB-S uses the QPSK modulation system to guard against errors in satel- 
lite transmissions caused by reduced signal-to-noise ratio, with channel cod- 
ing optimized to the error characteristics of the channel. A typical set of 
parameter values and 36 MHz transponder gives a useful data rate of around 
38 Mbps. 
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DVB-C uses Quadrature Amplitude Modulation (QAM), which is opti- 
mized for maximum data rate since the cable environment is less prone to 
interference than satellite or terrestrial. Systems from 16-QAM up to 256- 
QAM can be used, but the system centers on 64-QAM, in which an 8 MHz 
channel can accommodate a physical payload of about 38 Mbps. The cable 
return path uses Quadrature Phase Shift Keying (QPSK) modulation in a 200 
kHz, 1 MHz, or 2 MHz channel to provide a return path of up to about 3 Mbps. 
The path to the user may be either in-band (embedded in the MPEG-2 Trans- 
port Stream in the DVB-C channel) or out-of-band (on a separate 1 or 2 MHz 
frequency band). 


DVDs that contain linear PCM audio data in any combination of 44.1, 48.0, 
88.2, 96.0, 176.4, or 192 kHz sample rates, 16, 20, or 24 bits per sample, and 1 
to 6 channels, subject to a maximum bit rate of 9.6 Mbps. With a 176.4 or 192 
kHz sample rate, only two channels are allowed. 

Meridian Lossless Packing (MLP) is a lossless compression method that 
has an approximate 2:1 compression ratio. The use of MLP is optional, but the 
decoding capability is mandatory on all DVD-Audio players. 

Dolby Digital compressed audio is required for any video portion of a 
DVD-Audio disc. 


DVDs that contain about two hours of digital audio, video, and data. The video 
is compressed and stored using MPEG 2. The audio is either linear PCM or 
Dolby Digital compressed audio. DTS compressed audio may also be used as 
an option. 

Linear PCM audio can be sampled at 48 or 96 kHz, 16, 20, or 24 bits per 
sample, and 1 to 8 channels. The maximum bit rate is 6.144 Mbps, which lim- 
its sample rates and bit sizes in some cases. 

For Dolby Digital audio, the bit rate is 64 to 448 kbps, with 384 kbps being 
the normal rate for 5.1 channels and 192 kbps being the normal rate for ste- 
reo. The channel combinations are (front/surround): 1/0, 1+1/0 (dual mono), 
2/0, 3/0, 2/1, 3/1, 2/2, and 3/2. The LFE channel (0.1) is optional with all 8 
combinations. 

For DTS audio, the bit rate is 64 to 1,536 kbps. The channel combinations 
are (front/surround): 1/0, 2/0, 3/0, 2/1, 2/2, 3/2. The LFE channel (0.1) is 
optional with all 6 combinations. 


Abbreviation for Digital Visual Interface. This is a digital video interface to a 
display, designed to replace the analog YPbPr or RGB interface. For analog 
displays, the D/A conversion resides in the display. 
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See Digital Vertical Interval Timecode. 


The weakest to the strongest signal a circuit will accept as input or generate as 
an output. 


See Enhanced Definition Television. 

Electronics Industries Alliance. 

United States teletext standard, also called NABTS. 

United States closed captioning and extended data services (XDS) standard. 


NTSC Copy Generation Management System—Analog (CGMS-A). This adds 
copy protection capabilities to NTSC video by extending the EIA-608 standard 
to control the Macrovision anti-copy process. 


United States DTV closed captioning standard. 


NTSC “v-chip” operation. This adds content filtering capabilities to NTSC 
video by extending the EIA-608 standard. 


Specifies how to convert QAM to 8-VSB, with support for OSD (on screen dis- 
plays). 


Specifies how to convert QAM to 8-VSB, with no support for OSD (on screen 
displays). 


United States HDTV content advisory standard. 


This specification consists of three parts (EJA-770.1, EJA-770.2, and EIA- 
770.3). EIA-770.1 and EIA-770.2 define the analog YPbPr video interface for 
525-line interlaced and progressive SDTV systems. EJA-770.3 defines the ana- 
log YPbPr video interface for interlaced and progressive HDTV systems. 


This standard defines a specification for a baseband digital interface to a DTV 
using IEEE 1394 and provides a level of functionality that is similar to the ana- 
log system. It is designed to enable interoperability between a DTV and vari- 
ous types of consumer digital audio/video sources, including settop boxes 
and digital VCRs. 


This EIA-J recommendation specifies another widescreen signaling (WSS) 
standard for NTSC video signals. WSS may be present on 20 and 283. 
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Enhanced EDTYV is a television capable of displaying at least 480 progressive active scan 

Definition lines. No aspect ratio is specified. 

Television For the ATSC system, typical EDTV (luminance) resolutions and refresh 
rates are: 


Refresh Rate 
Active Aspect (p = progressive, i = interlaced) 
Recon Ratio | 53.976p | 29.97p | 59.94p 
24p 30p 60p 
720 x 360 16:9 x x x 
640 x 480 4:3 x x x 
720 x 480 4:3 x x x 


For the DVB system, typical EDTV (luminance) resolutions and refresh 
rates are: 


Refresh Rate 
Active ikapact (p = progressive, i = interlaced) 
Lene Ratio | 53.976p asp | 22-277 | sop | 59.94 
24p 30p 60p 

720 x 432 16:9 x x x 

352 x 576 4:3 x x x 

480 x 576 4:3 x x x 

544 x 576 4:3 x x x 

720 x 576 4:3 x x x 

720 x 360 16:9 x x x 
352 x 480 4:3 x x x 
480 x 480 4:3 x x x 
544 x 480 4:3 x x x 
640 x 480 4:3 x x x 
720 x 480 4:3 x x x 
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These are two groups of pulses, one that occurs before the serrated vertical 
sync and another group that occurs after. These pulses happen at twice the 
normal horizontal scan rate. They exist to ensure correct 2:1 interlacing in 
early televisions. 

This specification defines NICAM 728 digital audio for PAL. 

This specification defines information sent during the vertical blanking inter- 


val using PAL teletext (ETSI ETS 300 706) to control VCRs in Europe (PDC). 


Defines the widescreen signaling (WSS) information for PAL video signals. 
For (B, D, G, H, I) PAL systems, WSS may be present on line 23. 


This is the DVB-S specification. 


This is the DVB-C specification. 


This is the enhanced PAL teletext specification. 


This specification defines data transmission using PAL teletext (ETSI ETS 300 
706). 


Defines the PALplus standard, allowing the transmission of 16:9 programs 
over normal PAL transmission systems. 


Defines the ghost cancellation reference (GCR) signal for PAL. 


This is the DVB subtitling specification. 


This is the DVB-T specification. 


Fading is a method of switching from one video source to another. Next time 
you watch a TV program (or a movie), pay extra attention when the scene is 
about to end and go on to another. The scene fades to black, then a fade from 
black to another scene occurs. Fading between scenes without going to black 
is called a dissolve. One way to do a fade is to use an Alpha Mixer. 
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An interlaced display is made using two fields, each one containing one-half of 
the scan lines needed to make up one frame of video. Each field is displayed in 
its entirety—therefore, the odd field is displayed, then the even, then the odd, 
and so on. Fields only exist for interlaced scanning systems. So for (M) NTSC, 
which has 525 lines per frame, a field has 262.5 lines, and two fields make up a 
525-line frame. 


When Apple Computer initially developed IEEE 1394, they called it Firewire. 


Flicker occurs when the frame rate of the video is too low. It’s the same effect 
produced by an old fluorescent light fixture. The two problems with flicker are 
that it’s distracting and tiring to the eyes. 


See Frequency Modulation. 


A frame of video is essentially one picture or “still” out of a video stream. By 
playing these individual frames fast enough, it looks like people are “moving” 
on the screen. It’s the same principle as flip cards, cartoons, and movies. 


A frame buffer is a memory used to hold an image for display. How much 
memory are we talking about? Well, let’s assume a horizontal resolution of 640 
pixels and 480 scan lines, and we'll use the RGB color space. This works out to 
be: 


640 x 480 x 3 = 921,600 bytes or 900 kB 
So, 900 kB are needed to store one frame of video at that resolution. 
The frame rate of a video source is how fast a new still image is available. For 
example, with the NTSC system, the entire display is repainted about once 
every 30th of a second, for a frame rate of about 30 frames per second. For 
PAL, the frame rate is 25 frames per second. For computer displays, the frame 


rate is usually about 75 frames per second. 


Frame rate conversion is the act of converting one frame rate to another. 


This technique sends data as frequency variations of a carrier signal. 


This is the area of the video waveform that sits between the start of horizontal 
blanking and the start of horizontal sync. 
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Correction 


GCR 


Genlock 


Ghost 
Cancellation 
Reference 


Gray Scale 


H.261, H.263 


The transfer characteristics of most cameras and displays are nonlinear. For a 
display, a small change in amplitude when the signal level is small produces a 
change in the display brightness level, but the same change in amplitude at a 
high level will not produce the same magnitude of brightness change. This 
nonlinearity is known as gamma. 


Before being displayed, linear RGB data must be processed (gamma cor- 
rected) to compensate for the nonlinearity of the display. 


See Ghost Cancellation Reference Signal. 


A video signal provides all of the information necessary for a video decoder to 
reconstruct the picture. This includes brightness, color, and timing informa- 
tion. To properly decode the video signal, the video decoder must lock to all 
the timing information embedded within the video signal, including the color 
burst, horizontal sync, and vertical sync. The decoder looks at the color burst 
of the video signal and reconstructs the original color subcarrier that was used 
by the encoder. This is needed to properly decode the color information. It 
also generates a sample clock (done by looking at the sync information within 
the video signal), used to clock pixel data out of the decoder into a memory or 
another circuit for processing. The circuitry within the decoder that does all of 
this work is called the genlock circuit. Although it sounds simple, the genlock 
circuit must be able to handle very bad video sources, such as the output of 
VCRs, cameras, and toys. In reality, the genlock circuit is the most complex 
section of a video decoder. 


A reference signal on (M) NTSC scan lines 19 and 282 and (B, D, G, H, I) PAL 
scan line 318 that allows the removal of ghosting from TVs. Filtering is 
employed to process the transmitted GCR signal and determine how to filter 
the entire video signal to remove the ghosting. ITU-R BT.1124 and ETSI ETS 
300 732 define the standard each country uses. ATSC A/49 also defines the 
standard for NTSC. 


The term gray scale has several meanings. It some cases, it means the luma 
component of color video signals. In other cases, it means a black-and-white 
video signal. 


The ITU-T H.261 and H.263 video compression standards were developed to 
implement video conferencing over ISDN, LANs, regular phone lines, etc. 
H.261 supports video resolutions of 352 x 288 and 176 x 144 at up to 29.97 
frames per second. H.263 supports video resolutions of 1408 x 1152, 704 x 576, 
352 x 288, 176 x 144, and 128 x 96 at up to 29.97 frames per second. 
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HD-CIF 
HD-SDTI 
HDTV 


(High 
Definition TV) 


High Definition 
Television 


Highpass 
Filter 


See CIF. 
High Data-Rate Serial Data Transport Interface, defined by SMPTE 348M. 


Short for High Definition Television. HDTV is capable of displaying at least 
720 progressive or 1080 interlaced active scan lines. It must be capable of dis- 
playing a 16:9 image using at least 540 progressive or 810 interlaced active 
scan lines. 

For the ATSC system, typical HDTV (uminance) resolutions and refresh 
rates are: 


Refresh Rate 
(p = progressive, i = interlaced) 
Active Aspect 
Resolution Ratio | 23.976p| 29.97p | 59.941 | 59.94p 
24p 30p 60i 60p 
1280 x 720 16:9 x x x 
1920 x 1080 16:9 x x x 


For the DVB system, typical HDTV (luminance) resolutions and refresh 
rates are: 


Refresh Rate 
(p = progressive, i = interlaced) 
Active Aspect 
Resolution Ratio | 23.976p 25 29.97p 50i 59.941 | 59.94p 
24p P 30p 50p 60i 60p 
1280 x 720 16:9 x x x x x x 
1440 x 1080 16:9 x x x x x x 
1920 x 1080 16:9 x x x x x x 
See HDTV. 


A circuit that passes frequencies above a specific frequency (the cutoff fre- 
quency). Frequencies below the cutoff frequency are reduced in amplitude to 
eliminate them. 
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Rate 
Horizontal 
Sync 

House Sync 


HSI 


HSL 


HSV 
HSYNC 


Hue 


During the horizontal blanking interval, the video signal is at the blank level 
so as not to display the electron beam when it sweeps back from the right to 
the left side of the CRT screen. 


See Resolution. 


This is how fast the scanning beam in a display sweeps from side to side. In 
the NTSC system, this rate is 63.556 ms, or 15.734 kHz. That means the scan- 
ning beam moves from side to side 15,734 times a second. 


This is the portion of the video signal that tells the display where to place the 
image in the left-to-right dimension. The horizontal sync pulse tells the receiv- 
ing system where the beginning of the new scan line is. 


This is another name for black burst. 


HSI stands for Hue, Saturation and Intensity. HSI is based on polar coordi- 
nates, while the RGB color space is based on a three-dimensional Cartesian 
coordinate system. The intensity, analogous to luma, is the vertical axis of the 
polar system. The hue is the angle and the saturation is the distance out from 
the axis. HSI is more intuitive to manipulate colors as opposed to the RGB 
space. For example, in the HSI space, if you want to change red to pink, you 
decrease the saturation. In the RGB space, what would you do? My point 
exactly. In the HSI space, if you wanted to change the color from purple to 
green, you would adjust the hue. Take a guess what you would have to do in 
the RGB space. However, the key thing to remember, as with all color spaces, 
is that it’s just a way to represent a color—nothing more, nothing less. 


This is similar to HSI, except that HSL stands for Hue, Saturation and Light- 
ness. 


This is similar to HSI, except that HSV stands for Hue, Saturation and Value. 
Check out the Horizontal Sync definition. 


In technical terms, hue refers to the wavelength of the color. That means that 
hue is the term used for the base color—red, green, yellow, etc. Hue is com- 
pletely separate from the intensity or the saturation of the color. For example, 
a red hue could look brown at low saturation, bright red at a higher level of 
saturation, or pink at a high brightness level. All three “colors” have the same 
hue. 


Glossary 709 


Huffman 
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IEC 60461 


IEC 60958 


IEC 61834 


IEC 61880 


IEC 61883 


IEC 62107 


IEEE 1394 


iLink 


Huffman coding is a method of data compression. It doesn’t matter what the 
data is—it could be image data, audio data, or whatever. It just so happens that 
Huffman coding is one of the techniques used in JPEG, MPEG, H.261, and 
H.263 to help with the compression. This is how it works. First, take a look at 
the data that needs to be compressed and create a table that lists how many 
times each piece of unique data occurs. Now assign a very small code word to 
the piece of data that occurs most frequently. The next largest code word is 
assigned to the piece of data that occurs next most frequently. This continues 
until all of the unique pieces of data are assigned unique code words of vary- 
ing lengths. The idea is that data that occurs most frequently is assigned a 
small code word, and data that rarely occurs is assigned a long code word, 
resulting in space savings. 


See Improved Definition Television. 


Defines the longitudinal (LTC) and vertical interval (VITC) timecode for 
NTSC and PAL video systems. LTC requires an entire field time to transfer 
timecode information, using a separate track. VITC uses one scan line each 
field during the vertical blanking interval. Also see SMPTE 12M. 


Defines a serial digital audio interface for consumer (SPDIF) and professional 
applications. 


Defines the DV (originally the “Blue Book”) standard. Also see SMPTE 314M. 


Defines the widescreen signaling (WSS) information for NTSC video signals. 
WSS may be present on lines 20 and 283. 


Defines the methods for transferring data, audio, DV (IEC 61834), and MPEG 
2 data over IEEE 1394. 


Defines the Super VideoCD standard. 

A high-speed “daisy-chained” serial interface. Digital audio, video, and data 
can be transferred with either a guaranteed bandwidth or a guaranteed 
latency. It is hot-pluggable, and uses a small 6-pin or 4-pin connector, with the 


6-pin connector providing power. 


Sony's name for their IEEE 1394 interface. 
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Illegal Video 


Improved 
Definition 
Television 


Intensity 


Interlaced 


Interpolation 


IRE Unit 


ITU-R BT.xxx 


Jitter 


JPEG 


Some colors that exist in the RGB color space can’t be represented in the 
NTSC and PAL video domain. For example, 100% saturated red in the RGB 
space (which is the red color on full strength and the blue and green colors 
turned off) can’t exist in the NTSC video signal, due to color bandwidth limita- 
tions. The NTSC encoder must be able to determine that an illegal color is 
being generated and stop that from occurring, since it may cause over-satura- 
tion and blooming. 


IDTV is different from HDTV. IDTV is a system that improves the display on 
TVs by adding processing in the TV; standard NTSC or PAL signals are trans- 
mitted. 


This is the same thing as Brightness. 


An interlaced video system is one where two interleaved fields are used to 
generate one video frame. Therefore, the number of lines in a field is one-half 
of the number of lines in a frame. In NTSC, there are 262.5 lines per field (525 
lines per frame), while there are 312.5 lines per field (625 lines per frame) in 
PAL. Each field is drawn on the screen consecutively—first one field, then the 
other. 


Interpolation is a mathematical way of generating additional information. Let’s 
say that an image needs to be scaled up by a factor of two, from 100 samples to 
200 samples. The “missing” samples are generated by calculating (interpolat- 
ing) new samples between two existing samples. After all of the “missing” 
samples have been generated—presto!—200 samples exist where only 100 
existed before, and the image is twice as big as it used to be. 


An arbitrary unit used to describe the amplitude characteristics of a video sig- 
nal. White is defined to be 100 IRE and the blanking level is defined to be 0 
IRE. 


See BT.xxx. 


Short-term variations in the characteristics (such as frequency, amplitude, 
etc.) of a signal. 


JPEG stands for Joint Photographic Experts Group. However, what people 
usually mean when they use the term “JPEG” is the image compression stan- 
dard they developed. JPEG was developed to compress still images, such as 
photographs, a single video frame, something scanned into the computer, and 
so forth. You can run JPEG at any speed that the application requires. For a 
still picture database, the algorithm doesn’t have to be very fast. If you run 
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kbps 
kBps 


Line-Locked 
Clock 


Line Store 


Linearity 


Locked 


Longitudinal 
Timecode 


Lossless 


Lossy 


JPEG fast enough, you can compress motion video—which means that JPEG 
would have to run at 50 or 60 fields per second. This is called motion JPEG or 
M-JPEG. You might want to do this if you were designing a video editing sys- 
tem. Now, M-JPEG running at 60 fields per second is not as efficient as MPEG 
2 running at 60 fields per second because MPEG was designed to take advan- 
tage of certain aspects of motion video. 


Abbreviation for kilobits per second. 
Abbreviation for kilobytes per second. 


A design that ensures that there is always a constant number of samples per 
scan line, even if the timing of the line changes. 


A line store is amemory used to hold one scan line of video. If the horizontal 
resolution of the active display is 640 samples and RGB is used as the color 
space, the line store would have to be 640 locations long by 3 bytes wide. This 
amounts to one location for each sample and each color. Line stores are typi- 
cally used in filtering algorithms. For example, a comb filter is made up of one 
or more line stores. 


Linearity is a basic measurement of how well an ADC or DAC is performing. 
Linearity is typically measured by making the ADC or DAC attempt to gener- 
ate a linearly increasing signal. The actual output is compared to the ideal of 
the output. The difference is a measure of the linearity. The smaller the num- 
ber, the better. Linearity is typically specified as a range or percentage of LSBs 
(Least Significant Bits). 


When a PLL is accurately producing timing that is precisely lined up with the 
timing of the incoming video source, the PLL is said to be “locked.” When a 
PLL is locked, the PLL is stable and there is minimum jitter in the generated 
sample clock. 


Timecode information is stored on a separate track from the video, requiring 
an entire field time to store or read it. 


Lossless is a term used with compression. Lossless compression is when the 
decompressed data is exactly the same as the original data. It’s lossless 
because you haven’t lost anything. 


Lossy compression is the exact opposite of lossless. The regenerated data is 
different from the original data. The differences may or may not be noticeable, 
but if the two images are not identical, the compression was lossy. 
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Mbps 
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MESECAM 


M-JPEG 


Modulator 


Moiré 


Monochrome 


A circuit that passes frequencies below a specific frequency (the cutoff fre- 
quency). Frequencies above the cutoff frequency are reduced in amplitude to 
eliminate them. 


See Longitudinal Timecode. 


As mentioned in the definition of chroma, the NTSC and PAL video systems 
use a signal that has two pieces: the black and white part, and the color part. 
The black and white part is the luma. It was the luma component that allowed 
color TV broadcasts to be received by black and white TVs and still remain 
viewable. 


In video, the terms luminance and luma are commonly (and incorrectly) inter- 
changed. See the definition of Luma. 


Abbreviation for megabits per second. 
Abbreviation for megabytes per second. 


A technique of recording SECAM video. Instead of dividing the FM color sub- 
carrier by four and then multiplying back up on playback, MESECAM uses 
the same heterodyne conversion as PAL. 


See Motion JPEG. 


A modulator is basically a circuit that combines two different signals in such a 
way that they can be pulled apart later. What does this have to do with video? 
Let’s take the NTSC system as an example, although the example applies 
equally as well to PAL. The NTSC system may use the YIQ or YUV color 
space, with the I and Q or U and V signals containing all of the color informa- 
tion for the picture. Two 3.58-MHz color subcarriers (90 degrees out of phase) 
are modulated by the I and Q or U and V components and added together to 
create the chroma part of the NTSC video. 


This is a type of image artifact. A moire effect occurs when a pattern is created 
on the display where there really shouldn't be one. A moire pattern is typically 
generated when two different frequencies beat together to create a new, 
unwanted frequency. 


A monochrome signal is a video source having only one component. Although 
usually meant to be the luma (or black-and-white) video signal, the red video 
signal coming into the back of a computer display is monochrome because it 
only has one component. 
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Motion 
JPEG 


MPEG 


MPEG 1 


MPEG 2 
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MPEG 4 


This is a term that is used to describe ADCs and DACs. An ADC or DAC is 
said to be monotonic if for every increase in input signal, the output increases. 
Any ADC or DAC that is nonmonotonic—meaning that the output decreases 
for an increase in input—is bad! Nobody wants a nonmonotonic ADC or DAC. 


Motion estimation is trying to figure out where an object has moved to from 
one video frame to the other. Why would you want to do that? Well, let’s take 
an example of a video source showing a ball flying through the air. The back- 
ground is a solid color that is different from the color of the ball. In one video 
frame the ball is at one location and in the next video frame the ball has moved 
up and to the right by some amount. Now let’s assume that the video camera 
has just sent the first video frame of the series. Now, instead of sending the 
second frame, wouldn’t it be more efficient to send only the position of the 
ball? Nothing else moves, so only two little numbers would have to be sent. 
This is the essence of motion estimation. By the way, motion estimation is an 
integral part of MPEG, H.261, and H.263. 


JPEG compression or decompression that is applied real-time to video. Each 
field or frame of video is individually processed. 


MPEG stands for Moving Picture Experts Group. This is an ISO/IEC (Inter- 
national Standards Organization) body that is developing various compression 
algorithms. MPEG differs from JPEG in that MPEG takes advantage of the 
redundancy on a frame-to-frame basis of a motion video sequence, whereas 
JPEG does not. 


MPEG 1 was the first MPEG standard defining the compression format for 
real-time audio and video. The video resolution is typically 352 x 240 or 352 x 
288, although higher resolutions are supported. The maximum bit rate is 
about 1.5 Mbps. MPEG 1 is used for the Video CD format. 


MPEG 2 extends the MPEG 1 standard to cover a wider range of applications. 
Higher video resolutions are supported to allow for HDTV applications, and 
both progressive and interlaced video are supported. MPEG 2 is used for the 
DVD-Video and SVCD formats, and also forms the basis for digital SDTV and 
HDTV. 


MPEG 3 was originally targeted for HDTV applications. This was incorporated 
into MPEG 2, so there is no MPEG 3 standard. 


MPEG 4 uses an object-based approach, where scenes are modeled as compo- 
sitions of objects, both natural and synthetic, with which the user may inter- 
act. Visual objects in a scene are described mathematically and given a 
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MPEG 7 


MTS 


NABTS 


NexTView 


NICAM 728 


Noninterlaced 


NTSC 


position in a two- or three-dimensional space. Similarly, audio objects are 
placed in a sound space. Thus, the video or audio object need only be defined 
once; the viewer can change his viewing position, and the calculations to 
update the audio and video are done locally. Classical “rectangular” video, as 
from a camera, is one of the visual objects defined in the standard. In addition, 
there is the ability to map images onto computer-generated shapes, and a text- 
to-speech interface. 


MPEG 7 standardizes the description of multimedia material (referred to as 
metadata), such as still pictures, audio, and video, regardless if locally stored, 
in a remote database, or broadcast. Examples are finding a scene in a movie, 
finding a song in a database, or selecting a broadcast channel. The searcher 
for an image can use a sketch or a general description. Music can be found 
using a “query by humming” format. 


Multichannel Television Sound. A generic name for various stereo audio 
implementations, such as BTSC and Zweiton. 


North American Broadcast Teletext Specification (EIA-516). This is also ITU- 
R BT.653 525-line system C teletext. However, the NABTS specification goes 
into much more detail. 


An electronic program guide (EPG) based on ETSI ETS 300 707. 


A technique of implementing digital stereo audio for PAL video using another 
audio subcarrier. The bit rate is 728 kbps. It is discussed in BS.707 and ETSI 
ETS 300 163. NICAM 728 is also used to transmit non-audio digital data in 
China. 


This is a method of scanning out a video display that is the total opposite of 
interlaced. All of the lines in the frame are scanned out sequentially, one right 
after the other. The term “field” does not apply in a noninterlaced system. 
Another term for a noninterlaced system is progressive scan. 


Never Twice the Same Color, Never The Same Color, or National Television 
Standards Committee, depending on who you're talking to. Technically, NTSC 
is just a color modulation scheme. To fully specify the color video signal, it 
should be referred to as (M) NTSC. “NTSC” is also commonly (though incor- 
rectly) used to refer to any 525/59.94 or 525/60 video system. See also NTSC 
4,43. 
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PAL 60 


PALplus 


PDC 


Pedestal 


This is a NTSC video signal that uses the PAL color subcarrier frequency 
(about 4.43 MHz). It was developed by Sony in the 1970s to more easily adapt 
European receivers to accept NTSC signals. 


Organisation Internationale de Radiodiffision-Television. 
See subtitles. 


See Raw VBI Data. 


When an image is displayed, it is “overscanned” if a small portion of the image 
extends beyond the edges of the screen. Overscan is common in TVs that use 
CRTs to allow for aging and variations in components, temperature and power 


supply. 


PAL stands for Phase Alternation Line, Picture Always Lousy, or Perfect At 
Last depending on your viewpoint. Technically, PAL is just a color modulation 
scheme. To fully specify the color video signal it should be referred to as (B, 
D, G, H, I, M, N, or Nc) PAL. (B, D, G, H, I) PAL is the color video standard 
used in Europe and many other countries. (M, N, Nc) PAL is also used in a 
few places, but is not as popular. “PAL” is also commonly (though incorrectly) 
used to refer to any 625/50 video system. See also PAL 60. 


This is a NTSC video signal that uses the PAL color subcarrier frequency 
(about 4.43 MHz) and PAL-type color modulation. It is a further adaptation of 
NTSC 4.43, modifying the color modulation in addition to changing the color 
subcarrier frequency. It was developed by JVC in the 1980s for use with their 
video disc players, hence the early name of “Disk-PAL.” 

There is a little-used variation, also called PAL 60, which is a PAL video 
signal that uses the NTSC color subcarrier frequency (about 3.58 MHz), and 
PAL-type color modulation. 


PALplus is 16:9 aspect ratio version of PAL, designed to be transmitted using 
normal PAL systems. 16:9 TVs without the PALplus decoder, and standard 4:3 
TVs, show a standard picture. It is defined by BT.1197 and ETSI ETS 300 731. 


See Program Delivery Control. 


Pedestal is an offset used to separate the black level from the blanking level by 
a small amount. When a video system doesn’t use a pedestal, the black and 
blanking levels are the same. (M) NTSC uses a pedestal, (B, D, G, H, I) PAL 
does not. (M) NTSC-J used in Japan also does not use a pedestal. 
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Pseudo Color 


This is a term used to describe a method of adjusting the hue in a NTSC video 
signal. The phase of the color subcarrier is moved, or adjusted, relative to the 
color burst. PAL and SECAM systems do not usually have a phase (or hue) 
adjust control. 


A pixel, which is short for picture element, is the smallest sample that makes 
up a scan line. For example, when the horizontal resolution is defined as 640 
pixels, that means that there are 640 individual locations, or samples, that 
make up the visible portion of each horizontal scan line. Pixels may be square 
or rectangular. 


The pixel clock is used to divide the horizontal line of video into samples. The 
pixel clock has to be stable (a very small amount of jitter) relative to the video 
or the image will not be stored correctly. The higher the frequency of the pixel 
clock, the more samples per line there are. 


This can be a real troublemaker, since it can cause artifacts. In some 
instances, a pixel drop out looks like black spots on the screen, either station- 
ary or moving around. Several things can cause pixel drop out, such as the 
ADC not digitizing the video correctly. Also, the timing between the ADC and 
the frame buffer might not be correct, causing the wrong number to be stored 
in memory. For that matter, the timing anywhere in the video stream might 
cause a pixel drop out. 


A set of colors that can be combined to produce any desired set of intermedi- 
ate colors, within a limitation call the “gamut.” The primary colors for color 
television are red, green, and blue. The exact red, green, and blue colors used 
are dependent on the television standard. 


Information sent during the vertical blanking interval using teletext to control 
VCRs in Europe. The specification is ETSI ETS 300 231. 


See Noninterlaced. 


Pseudo color is a term used to describe a technique that applies color, or 
shows color, where it does not really exist. We are all familiar with the satellite 
photos that show temperature differences across a continent or the multicol- 
ored cloud motion sequences on the nightly weather report. These are real- 
world examples of pseudo color. The color does not really exist. A computer 
uses a lookup table memory to add the color so information, such as tempera- 
ture or cloud height, is viewable. 
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This is basically the same as H.261. The term is starting to fade away since 
H.261 is used in applications other than ISDN video conferencing. 


See Quadrature Amplitude Modulation. 


Quarter Common Interface Format. This video format was developed to allow 
the implementation of cheaper video phones. The QCIF format has a resolu- 
tion of 176 x 144 active pixels and a refresh rate of 29.97 frames per second. 


Quarter Standard Interface Format. The computer industry, which uses 
square pixels, has defined QSIF to be 160 x 120 active pixels, with a refresh 
rate of whatever the computer is capable of supporting. 


Quad chroma refers to a technique where the sample clock is four times the 
frequency of the color burst. For NTSC this means that the sample clock is 
about 14.32 MHz (4x 3.579545 MHz), while for PAL the sample clock is about 
17.73 MHz (4x 4.43361875 MHz). The reason these are popular sample clock 
frequencies is that, depending on the method chosen, they make the chromi- 
nance (color) decoding and encoding easier. 


A method of encoding digital data onto a carrier for RF transmission. QAM is 
typically used for cable transmission of digital SDTV and HDTV signals. DVB- 
C supports 16-QAM, 32-QAM, 64-QAM, 128-QAM, and 256-QAM, although 
receivers need only support up to 64-QAM. 


The modulation of two carrier components, which are 90 degrees apart in 
phase. 


The process of converting a continuous analog signal into a set of discrete lev- 
els (digitizing). 


This is the inherent uncertainty introduced during quantization since only dis- 
crete, rather than continuous, levels are generated. Also called quantization 
distortion. 


Essentially, a raster is the series of scan lines that make up a picture. You may 
from time to time hear the term raster line—it’s the same as scan line. All of 
the scan lines that make up a frame of video form a raster. 


A technique where VBI data (such as teletext and captioning data) is sampled 
by a fast sample clock (i.e. 27 MHz), and output. This technique allows soft- 
ware decoding of the VBI data to be done. 
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Rewritable time code, used in consumer video products. 


Pixels that are not “square pixels” are “rectangular pixels.” 


This is the amount of color subcarrier information present in white, gray, or 
black areas of a composite color video signal (ideally, there is none present). 
The number usually appears as —n dB. The larger “n” is, the better. 


This is the basic measurement of how much information is visible for an 
image. It is usually described as “h” x “v.” The “h” is the horizontal resolution 
(across the display) and the “v” is the vertical resolution (down the display). 
The higher the numbers, the better, since that means there is more detail to 
see. If only one number is specified, it is the horizontal resolution. 

Displays specify the maximum resolution they can handle, determined by 
the display technology and the electronics used. The actual resolution will be 
the resolution of either the source or the display, whichever is lower. 

Vertical resolution is the number of white-to-black and black-to-white tran- 
sitions that can be seen from the top to the bottom of the picture. The maxi- 
mum number is the number of active scan lines used by the image. The actual 
vertical resolution may be less due to processing, interlacing, overscanning, 
or limited by the source. 

Horizontal resolution is the number of white-to-black and black-to-white 
transitions that can be seen from the left to the right of the picture. For digital 
displays, the maximum number is the number of active pixels used by a scan 
line. For both analog and digital displays, the actual horizontal resolution may 
be less due to processing, overscanning, or limited by the source. 


Retrace is what the electron beam does when it gets to the right-hand edge of 
the CRT display to get back to the left-hand edge. Retrace happens during the 
horizontal blanking time. 


Abbreviation for red, green, blue. 


RS-170 is the United States standard that was used for black-and-white TV, and 
defines voltage levels, blanking times, the width of the sync pulses, and so 
forth. The specification spells out everything required for a receiver to display 
a monochrome picture. Now, SMPTE 170M is essentially the same specifica- 
tion, modified for color TV by adding the color components. They modified 
RS-170 just a tiny little bit so that color could be added (RS-170A), with the 
final result being SMPTE 170M for NTSC. This tiny little change was so small 
that the existing black-and-white TVs didn’t even notice it. 
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RS-343 


RSDL 


Run 
Length Coding 


R-Y 


SABC 


Sample 


Sample and 
Hold 


Sample Rate 


RS-343 does the same thing as RS-170, defining a specification for transferring 
analog video, but the difference is that RS-343 is for high-resolution computer 
graphics analog video, while RS-170 is for TV-resolution NTSC analog video. 


RSDL stands for Reverse Spiral Dual Layer. It is a storage method that uses 
two layers of information on one side of a DVD. For movies that are longer 
than can be recorded on one layer, the disc stops spinning, reverses direction, 
and begins playing from the next layer. 


Run length coding is a type of data compression. Let’s say that this page is 
wide enough to hold a line of 80 characters. Now, imagine a line that is almost 
blank except for a few words. It’s 80 characters long, but it’s just about all 
blanks—let’s say 50 blanks between the words “coding” and “medium.” These 
50 blanks could be stored as 50 individual codes, but that would take up 50 
bytes of storage. An alternative would be to define a special code that said a 
string of blanks is coming and the next number is the amount of blanks in the 
string. So, using our example, we would need only 2 bytes to store the string 
of 50 blanks, the first special code byte followed by the number 50. We com- 
pressed the data; 50 bytes down to 2. This is acompression ration of 25:1. Not 
bad, except that we only compressed one line out of the entire document, so 
we should expect that the total compression ratio would be much less. 

Run length coding all by itself as applied to images is not as efficient as 
using a DCT for compression, since long runs of the same “number” rarely 
exist in real-world images. The only advantage of run length coding over the 
DCT is that it is easier to implement. Even though run length coding by itself 
is not efficient for compressing images, it is still used as part of the JPEG, 
MPEG, H.261, and H.263 compression schemes. 


In video, the red-minus-luma signal, also called a color difference signal. 
When added to the luma (Y) signal, it produces the red video signal. 


South Africa Broadcasting Corporation. 


To obtain values of a signal at periodic intervals. Also the value of a signal at a 
given moment in time. 


A circuit that samples a signal and holds the value until the next sample is 
taken. 


Sample rate is how often a sample of a signal is taken. The sample rate is 
determined by the sample clock. 
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SAP 


Saturation 


Scaling 


Scan Line 


Scan 
Velocity 
Modulation 


SCART 


SDI 


SDTI 


See Secondary Audio Program. 


Saturation is the amount of color present. For example, a lightly saturated red 
looks pink, while a fully saturated red looks like the color of a red crayon. Sat- 
uration does not mean the brightness of the color, just how much “pigment” is 
used to make the color. The less “pigment,” the less saturated the color is, 
effectively adding white to the pure color. 


Scaling is the act of changing the resolution of an image. For example, scaling 
a 640 x 480 image by one-half results in a 320 x 240 image. Scaling by 2x 
results in an image that is 1280 x 960. There are many different methods for 
image scaling, and some “look” better than others. In general, though, the bet- 
ter the algorithm “looks,” the more expensive it is to implement. 


A scan line is an individual line across the display. It takes 525 of these scan 
lines to make up a NTSC TV picture and 625 scan lines to make up a PAL TV 
picture. 


See Velocity Scan Modulation. 


Syndicat des Constructeurs d'Appareils Radio Recepteurs et Televiseurs. This 
is a 21-pin connector supported by many consumer video components in 
Europe. It allows mono or stereo audio and composite, s-video, or RGB video 
to be transmitted between equipment. 

The IEC 60933-1 and 60933-2 standards specify the basic SCART connec- 
tor, including signal levels. 


Serial Digital I/O. Another name for the 270 Mbps or 360 Mbps serial inter- 
face defined by BT.656. It is used primarily on professional and studio video 
equipment. 


Serial Data Transport Interface, defined by SMPTE 305M. 
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SDTV Short for Standard Definition Television. SDTV is a television that displays 


(Standard 
Definition TV) 
rates are: 


less active vertical resolution than EDTV. No aspect ratio is specified. 
For the ATSC system, typical SDTV (luminance) resolutions and refresh 


Refresh Rate 
Active Aspect (p = progressive, i = interlaced) 
Resolution Ratio 59.941 
60i 
720 x 360 16:9 x 
640 x 480 4:3 x 
720 x 480 4:3 x 


For the DVB system, typical SDTV (luminance) resolutions and refresh 


rates are: 
Refresh Rate 
Active Aspect (p = progressive, i = interlaced) 
Resolution, +) REO, | 33:976p asp | 22277 | soi | sop | 59-241 | 59-94p 
24p 30p 60i 60p 
352 x 288 4:3 x x x x 
720 x 432 16:9 x 
352 x 576 4:3 x 
480 x 576 4:3 x 
544 x 576 4:3 x 
720 x 576 4:3 x 
352 x 240 4:3 x x x x 
720 x 360 16:9 x 
352 x 480 4:3 x 
480 x 480 4:3 x 
544 x 480 4:3 x 
640 x 480 4:3 x 
720 x 480 4:3 x 


722 Chapter 16: Glossary 


SECAM 


Secondary 
Audio Program 


Serration 
Pulses 


Setup 


SIF 


Signal-to-Noise 
Ratio (SNR) 


Silent Radio 


Sliced VBI 
Data 


This is another color video format similar to PAL. The major differences 
between the two are that in SECAM the chroma is FM modulated and the R-Y 
and B-Y signals are transmitted line sequentially. SECAM stands for Sequen- 
tiel Couleur Avec Memoire or Sequential Color with Memory. 


Generally used to transmit audio in a second language. May also be used to 
transmit the [aural] description of key visual elements of a program, inserted 
into the natural pauses in the audio of the programming. 


These are pulses that occur during the vertical sync interval of NTSC, PAL, 
and SECAM, at twice the normal horizontal scan rate. The reason these exist 
was to ensure correct 2:1 interlacing in early televisions and eliminate DC off- 
set buildup. 


Setup is the same thing as Pedestal. 


Standard (or Source) Input Format. This video format was developed to allow 
the storage and transmission of digital video. The 625/50 SIF format has a res- 
olution of 352 x 288 active pixels and a refresh rate of 25 frames per second. 
The 525/60 SIF format has a resolution of 352 x 240 active pixels and a refresh 
rate of 30 frames per second. Note that MPEG 1 allows resolutions up to 4095 
x 4095 active pixels; however, there is a “constrained subset” of parameters 
defined as SIF. The computer industry, which uses square pixels, has defined 
SIF to be 320 x 240 active pixels, with a refresh rate of whatever the computer 
is capable of supporting. 


Signal-to-noise ratio is the magnitude of the signal divided by the amount of 
unwanted stuff that is interfering with the signal (the noise). SNR is usually 
described in decibels, or “dB,” for short; the bigger the number, the better 
looking the picture. 


Silent Radio is a service that feeds data that is often seen in hotels and night- 
clubs. It’s usually a large red sign that shows current news, events, scores, etc. 
It is present on NTSC lines 10-11 and 273-274, and uses encoding similar to 
EIA-608. 


A technique where a VBI decoder samples the VBI data (such as teletext and 
captioning data), locks to the timing information, and converts it to binary 0’s 
and 1’s. DC offsets, amplitude variations, and ghosting must be compensated 
for by the VBI decoder to accurately recover the data. 
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SMPTE 12M 


SMPTE 125M 


SMPTE 170M 


SMPTE 240M 


SMPTE 244M 


SMPTE 253M 


SMPTE 259M 


SMPTE 260M 


SMPTE 266M 


SMPTE 267M 


SMPTE 272M 


SMPTE 274M 


SMPTE 276M 


SMPTE 291M 


Defines the longitudinal (LTC) and vertical interval (VITC) timecode for 
NTSC and PAL video systems. LTC requires an entire field time to store time- 
code information, using a separate track. VITC uses one scan line each field 
during the vertical blanking interval. 


720 x 480 pro-video interlaced standard (29.97 Hz). Covers the digital repre- 
sentation and the digital parallel interface. Also see BT.601 and BT.656. 


NTSC video specification for the United States. See RS-170A and BT.470. 

1920 x 1035 pro-video interlaced standard (29.97 or 30 Hz). Covers the analog 
RGB and YPbPr representation. The digital parallel interface is defined by 
SMPTE 260M. The digital serial interface is defined by SMPTE 292M. 

768 x 486 pro-video interlaced standard (29.97 Hz). Covers the digital repre- 
sentation (composite NTSC video sampled at 4x Fsc) and the digital parallel 
interface. The digital serial interface is defined by SMPTE 259M. 

Analog RGB video interface specification for pro-video SDTV systems. 
Pro-video serial digital interface for SMPTE 244M. 

Digital representation and parallel interface for SMPTE 240M video. 

Defines the digital vertical interval timecode (DVITC). Also see BT.1366. 


960 x 480 pro-video interlaced standard (29.97 Hz). Covers the digital repre- 
sentation and the digital parallel interface. Also see BT.601 and BT.1302. 


Formatting AES/EBU digital audio and auxiliary data into the digital blanking 
intervals. Also see BT.1305. 


1920 x 1080 pro-video interlaced and progressive standards (29.97, 30, 59.94 
and 60 Hz). Covers the digital representation, the analog RGB and YPbPr 
interfaces, and the digital parallel interface. The digital serial interface is 
defined by SMPTE 292M. 


Transmission of AES/EBU digital audio and auxiliary data over coaxial cable. 


Ancillary data packet and space formatting for pro-video digital interfaces. 
Also see BT.1364. 
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SMPTE 292M 


SMPTE 293M 


SMPTE 294M 


SMPTE 296M 


SMPTE 299M 


SMPTE 305M 


SMPTE 308M 


SMPTE 314M 


SMPTE 322M 


SMTPE 344M 


SMPTE 348M 


1.485 Gbps pro-video HDTV serial interfaces. 


720 x 480 pro-video progressive standards (59.94 Hz). Covers the digital repre- 
sentation, the analog RGB and YPbPr interfaces, and the digital parallel inter- 
face. The digital serial interface is defined by SMPTE 294M. Also see BT.1358 
and BT.1362. 


Pro-video serial digital interface for SMPTE 293M. 


1280 x 720 pro-video progressive standards. Covers the digital representation 
and the analog RGB and YPbPr interfaces. The digital parallel interface uses 
SMPTE 274M. The digital serial interface is defined by SMPTE 292M. 


24-bit digital audio format for pro-video HDTV serial interfaces. Also see 
BT.1365. 


Serial data transport interface (SDTI). This is a 270 or 360 Mbps serial inter- 
face based on BT.656 that can be used to transfer almost any type of digital 
data, including MPEG 2 program and transport bitstreams, DV bitstreams, 
etc. You cannot exchange material between devices that use different data 
types. Material that is created in one data type can only be transported to 
other devices that support the same data type. There are separate map docu- 
ments that format each data type into the 305M transport. 


MPEG 2 4:2:2 profile at high level. 


Defines the data structure for DV audio, data and compressed video at 25 and 
50 Mbps. Also see IEC 61834. 


Data stream format for the exchange of DV audio, data and compressed video 
over the Serial Data Transport Interface (SDTI or SMPTE 305M). 


Defines a 540 Mbps serial digital interface for pro-video applications. 


High data-rate serial data transport interface (HD-SDTI). This is a 1.485 Gbps 
serial interface based on SMPTE 292M that can be used to transfer almost any 
type of digital data, including MPEG 2 program and transport bitstreams, DV 
bitstreams, etc. You cannot exchange material between devices that use differ- 
ent data types. Material that is created in one data type can only be trans- 
ported to other devices that support the same data type. There are separate 
map documents that format each data type into the 348M transport. 
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SMPTE RP160 


SPDIF 


Split Sync 
Scrambling 


Square 
Pixels 


Analog RGB and YPbPr video interface specification for pro-video HDTV sys- 
tems. 


Short for Sony/Philips Digital InterFace. This is a consumer interface used to 
transfer digital audio. A serial, self-clocking scheme is used, based on a coax 
or fiber interconnect. The audio samples may be 16-24 bits each. 16 different 
sampling rates are supported, with 32, 44.1, and 48 kHz being the most com- 
mon. IEC 60958 now fully defines this interface for consumer and professional 
applications. 


Split sync is a video scrambling technique, usually used with either horizontal 
blanking inversion, active video inversion, or both. In split sync, the horizontal 
sync pulse is “split,” with the second half of the pulse at +100 IRE instead of 
the standard -40 IRE. Depending on the scrambling mode, either the entire 
horizontal blanking interval is inverted about the +30 IRE axis, the active video 
is inverted about the +30 IRE axis, both are inverted, or neither is inverted. By 
splitting the horizontal sync pulse, a reference of both —40 IRE and +100 IRE is 
available to the descrambler. 

Since a portion of the horizontal sync is still at -40 IRE, some sync separa- 
tors may still lock on the shortened horizontal sync pulses. However, the tim- 
ing circuits that look for color burst a fixed interval after the end of horizontal 
sync may be confused. In addition, if the active video is inverted, some video 
information may fall below 0 IRE, possibly confusing sync detector circuits. 

The burst is always present at the correct frequency and timing; however, 
the phase is shifted 180 degrees when the horizontal blanking interval is 
inverted. 


A “square pixel” is one that has the same number of active samples both hori- 
zontally and vertically, for a 1:1 aspect ratio. Computers and HDTV use square 
pixels. 

Using 480 active scan lines for NTSC, if the display had a 1:1 aspect ratio, 
square pixels would mean there would be 480 active samples per line. Since 
the display has a 4:3 aspect ratio, the number of active samples is (480) * (4/3) 
or 640. To get 640 active samples per line, you need a 12.27 MHz sample 
clock. 

Using 576 active scan lines for PAL, if the display had a 1:1 aspect ratio, 
square pixels would mean there would be 576 active samples per line. Since 
the display has a 4:3 aspect ratio, the number of active samples is (576) * (4/3) 
or 768. To get 768 active samples per line, you need a 14.75 MHz sample 
clock. 
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Standard 
Definition 
Television 


Subcarrier 


Subsampled 


Subtitles 


Superblack 


Super 
VideoCD, 
Super VCD 


S-VHS 


See SDTV. 


A secondary signal containing additional information that is added to a main 
signal. 


Subsampled means that a signal has been sampled at a lower rate than some 
other signal in the system. A prime example of this is the 4:2:2 YCbCr color 
space used in ITU-R BT.601. For every two luma (Y) samples, only one Cb and 
Cr sample is present. This means that the Cb and Cr signals are subsampled. 


Text that is added below or over a picture that usually reflects what is being 
said, possibly in another language. Open subtitles are transmitted as video 
that already has the subtitles present. Closed subtitles are transmitted during 
the VBI, and relies on the TV to decode it and position it below or over the pic- 
ture. Closed captioning is a form of subtitling. Subtitling for DVB is specified 
in ETSI ETS 300 743. 


A keying signal that is embedded within the composite video signal as a level 
between black and sync. It is usually used to improve luma self-keying 
because the video signal contains black, making a good luma self-key hard to 
implement. When a downstream keyer detects the super black level, it inserts 
the second composite video signal. 


Next generation VideoCD, defined by the China National Technical Commit- 
tee of Standards on Recording, that hold 35-70 minutes of digital audio and 
video information. MPEG 2 video is used, with a resolution of 480 x 480 (29.97 
Hz frame rate) or 480 x 576 (25 Hz frame rate). Audio uses MPEG layer 2 ata 
bit rate of 32-384 Mbps, and supports four mono, two stereo, or 5.1 channels. 
Subtitles use overlays rather than subpictures (DVD-Video) or being encoded 
as video (VideoCD). Variable bit-rate encoding is used, with a maximum data 
bit rate of 2.6 Mbps. IEC 62107 defines the Super VideoCD standard. 


S-VHS is an enhancement to regular VHS video tape decks. S-VHS provides 
better resolution and less noise than VHS. S-VHS video tape decks support s- 
video inputs and outputs, although this is not required. It does, however, 
improve the quality by not having to separate and then merge the luma and 
chroma signals. 
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S-Video 


SVM 


Sync 


Sync 
Generator 


Sync Noise 
Gate 


Sync Stripper 


Synchronous 


SsvCcD 
TDF 


Teletext 


Separate video, also called Y/C video. Separate luma (Y) and chroma (C) 
video signals are used, rather than a single composite video signal. By simply 
adding together the Y and C signals, you generate a composite video signal. 

A DC offset of +2.3v may be present on the C signal when a letterbox pic- 
ture format is present. A DC offset of +5v may be present to indicate when a 
16:9 anamorphic picture format is present. A standard 4:3 receiver ignores all 
DC offsets, thus displaying a typical letterboxed picture. 

The IEC 60933-5 standard specifies the s-video connector, including signal 
levels. 


See Velocity Scan Modulation. 


Sync is a fundamental, you gotta have it, piece of information for displaying 
any type of video. This goes for displays or PCs. Essentially, the sync signal 
tells the display where to put the picture. The horizontal sync, or HSYNC for 
short, tells the display where to put the picture in the left-to-right dimension. 
The vertical sync, or VSYNC for short, tells the display where to put the pic- 
ture from top-to-bottom. 


A sync generator is a circuit that provides sync signals. A sync generator may 
have genlock capability. 


A sync noise gate is used to define an area within the video waveform where 
the video decoder is to look for the sync pulse. Anything outside of this 
defined window will be rejected. The main purpose of the sync noise gate is to 
make sure that the output of the video decoder is nice, clean, and correct. 


A video signal contains video information, which is the picture to be displayed, 
and timing (sync) information that tells the receiver where to put this video 
information on the display. A sync stripper pulls out the sync information from 
the video signal and throws the rest away. 


Refers to two or more events that happen in a system or circuit at the same 
time. 


See Super VideoCD. 
Telediffusion de France. 


A method of transmitting data with a video signal. ITU-R BT.653 lists the 
major teletext systems used around the world, while ETSI ETS 300 706 
defines in detail the teletext standard for PAL. North American Broadcast 
Teletext Specification (NABTS) is 525-line system C. 
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Tessellated 
Sync 


Timebase 
Corrector 


True Color 


Underscan 


Uplink 
V-chip 
VBI 


Velocity Scan 
Modulation 


Vertical 
Blanking 
Interval (VBI) 


Vertical 
Interval 
Timecode 


For digital transmissions such as HDTV and SDTV, the teletext characters 
are multiplexed as a separate stream along with the video and audio data. It is 
common practice to actually embed this stream in the MPEG video bitstream 
itself, rather than at the transport layer. Unfortunately there is no wide-spread 
standard for this teletext stream—each system (DSS, DVB, ATSC, DVD) has 
its own solution. 


This is what the Europeans call serrated sync. See Serration Pulses and Sync. 


Certain video sources have their sync signals screwed up. The most common 
of these sources is the VCR. A timebase corrector “fixes” a video signal that 
has bad sync timing. 


True color means that each sample of an image is individually represented 
using three color components, such as RGB or YCbCr. 


When an image is displayed, it is “underscanned” if all of the image, including 
the top, bottom, and side edges, are visible on the display. Underscan is com- 
mon in computer displays. 


The carrier used by Earth stations to transmit information to a satellite. 
See EJA-744. 
See Vertical Blanking Interval. 


Commonly used in TVs to increase the apparent sharpness of a picture. At 
horizontal dark-to-light transitions, the beam scanning speed is momentarily 
increased approaching the transition, making the display relatively darker just 
before the transition. Upon passing into the lighter area, the beam speed is 
momentarily decreased, making the display relatively brighter just after the 
transition. The reverse occurs in passing from light to dark. 


During the vertical blanking interval, the video signal is at the blanking level 
so as not to display the electron beam when it sweeps back from the bottom to 
the top side of the CRT screen. 


Timecode information is stored on a scan line during each vertical blanking 
interval. 
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Vertical 
Resolution 


Vertical Scan 
Rate 


Vertical Sync 


Vestigial 
Sideband 


Video Carrier 


Video CD 


Video 
Interface Port 


Video Mixing 
Video 
Modulation 


Video Module 
Interface (VMI) 


Video Program 
System (VPS) 


VIP 


See Resolution. 


For noninterlaced video, this is the same as the frame rate. For interlaced 
video, it is usually one-half the field rate. 


This is the portion of the video signal that tells the video decoder where the 
top of the picture is. 


A method of encoding digital data onto a carrier for RF transmission. 8-VSB is 
used for over-the-air broadcasting of ATSC HDTV in the United States. 


A specific frequency that is modulated with video data before being mixed 
with the audio data and transmitted. 


Compact discs that hold up to about an hour of digital audio and video infor- 
mation. MPEG 1 video is used, with a resolution of 352 x 240 (29.97 Hz frame 
rate) or 352 x 288 (25 Hz frame rate). Audio uses MPEG layer 2 at a fixed bit 
rate of 224 kbps, and supports two mono or one stereo channels (with optional 
Dolby pro-logic). Fixed bit-rate encoding is used, with a bit rate of 1.15 Mbps. 
The next generation, defined for the Chinese market, is Super VideoCD. 


A digital video interface designed to simplify interfacing video ICs together. 
One portion is a digital video interface (based on BT.656) designed to simplify 
interfacing video ICs together. A second portion is a host processor interface. 
VIP is a VESA specification. 


Video mixing is taking two independent video sources (they must be gen- 
locked) and merging them together. See Alpha Mix. 


Converting a baseband video signal to an RF signal. 

A digital video interface designed to simplify interfacing video ICs together. It 
is being replaced by VIP. 

VPS is used in some countries instead of PDC to control VCRs. The data for- 
mat is the same as for PDC, except that it is transmitted on a dedicated line 


during the vertical blanking interval, usually line 16. 


See Video Interface Port. 
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VITC 

VMI 

VPS 

VSB 

VSM 

VSYNC 
White Level 
Wide Screen 


Signalling 
(WSS) 


World System 
Teletext (WST) 


WSS 
WST 
Y/C Video 


Y/C Separator 


YCbCr 


See Vertical Interval Timecode. 

See Video Module Interface. 

See Video Program System. 

See Vestigial Sideband. 

See Velocity Scan Modulation. 

Check out the Vertical Sync definition. 

This level defines what white is for the particular video system. 


WSS may be used on (B, D, G, H, I) PAL line 23 and (M) NTSC lines 20 and 
283 to specify the aspect ratio of the program and other information. 16:9 TVs 
may use this information to allow displaying of the program in the correct 
aspect ratio. ITU-R BT.1119 and ETSI ETS 300 294 specify the WSS signal for 
PAL and NTSC systems. EJA-J CPR-1204 and IEC 61880 also specify another 
WSS signal for NTSC systems. 


BT.653 525-line and 625-line system B teletext. 


See Wide Screen Signaling. 
See World System Teletext. 
See S-video. 


A Y/C separator is what’s used in a video decoder to separate the luma and 
chroma in a NTSC or PAL system. This is the first thing that any video 
decoder must do. The composite video signal is fed to a Y/C separator so that 
the chroma can then be decoded further. 


YCbCr is the color space originally defined by BT.601, and now used for all 
digital component video formats. Y is the luma component and the Cb and Cr 
components are color difference signals. The technically correct notation is 
Y'Cb'Cr' since all three components are derived from R'G'B'. Many people use 
the YCbCr notation rather than Y'CbCr or Y'Cb'Cr'. 

4:4:4 Y'CbCr means that for every Y sample, there is one sample each of 
Ch and Cr. 
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YIQ 


YPbPr 


YUV 


YUV9 


4:2:2 Y'CbCr means that for every two horizontal Y samples, there is one 
sample each of Cb and Cr. 

4:1:1 Y'CbCr means that for every four horizontal Y samples, there is one 
sample each of Cb and Cr. 

4:2:0 Y'CbCr means that for every block of 2 x 2 Y samples, there is one 
sample each of Cb and Cr. There are three variations of 4:2:0 YCbCr, with the 
difference being the position of Cb and Cr sampling relative to Y. 

Note that the coefficients to convert R'G'B' to YCbCr are different for 
SDTV and HDTV applications. 


YIQ is a color space optionally used by the NTSC video system. The Y compo- 
nent is the black-and-white portion of the image. The I and Q parts are the 
color difference components; these are effectively nothing more than color 
placed over the black and white, or luma, component. Many people use the 
YIQ notation rather than Y'IQ or Y'I'Q'. The technically correct notation is 
Y'T'Q' since all three components are derived from R'G'B'. 


YPbPr is a scaled version of the YUV color space, with specific levels and tim- 
ing signals, designed to interface equipment together. For HDTV interfaces, 
0.3v tri-level sync signals are used, while for SDTV interfaces, a 0.3v bi-level 
sync signal is present. Most use the YPbPr notation rather than Y'PbPr or 
Y'Pb'Pr'. The technically correct notation is Y'Pb'Pr' since all three compo- 
nents are derived from R'G'B'. 


YUV is the color space used by the NTSC and PAL video systems. As with the 
YIQ color space, the Y is the luma component while the U and V are the color 
difference components. Many people use the YUV notation when they actually 
mean YCbCr data. Most use the YUV notation rather than Y'UV or Y'U'V'. The 
technically correct notation is Y'U'V' since all three components are derived 
from R'G'B'. 

YUV is also the name for some component analog interfaces on consumer 
equipment. Some manufacturers incorrectly label it YCbCr. THX certification 
requires it to be labeled YPbPr. 


Intel’s 4:1:0 YCbCr format. The picture is divided into blocks, with each block 
comprising 4 x 4 samples. For each block, sixteen 8-bit values of Y, one 8-bit 
value of Cb, and one 8-bit value of Cr are assigned. The result is an average of 
9 bits per pixel. 
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YUV12 


YUY2 
Zipper 


Zoom 


Zoomed Video 
Port 


ZV Port 


Zweiton 


Intel’s notation for MPEG 1 4:2:0 YCbCr stored in memory in a planar format. 
The picture is divided into blocks, with each block comprising 2 x 2 samples. 
For each block, four 8-bit values of Y, one 8-bit value of Cb, and one 8-bit value 
of Cr are assigned. The result is an average of 12 bits per pixel. 


Intel’s notation for 4:2:2 YCbCr format. 
See the definition for Creepy-Crawlies. 


Zoom is a type of image scaling. Zooming is making the picture larger so that 
you can see more detail. The examples described in the definition of scaling 
are also examples that could be used here. 


Used on laptops, the ZV Port is a point-to-point uni-directional bus between the 
PC Card host adapter and the graphics controller, enabling video data to be 
transferred real-time directly from the PC Card into the graphics frame buffer. 

The PC Card host adapter has a special multimedia mode configuration. If 
a non-ZV PC Card is plugged into the slot, the host adapter is not switched 
into the multimedia mode, and the PC Card behaves as expected. Once a ZV 
card has been plugged in and the host adapter has been switched to the multi- 
media mode, the pin assignments change. 


See Zoomed Video Port. 


A technique of implementing stereo or dual-mono audio for NISC and PAL 
video. One FM subcarrier transmits a L+R signal, and a second FM subcarrier 
transmits a R signal (for stereo) or a second L+R signal. It is discussed in 
BS.707, and is similar to the BTSC technique. 
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10-step staircase test signal 309 
10T pulse 320 
12.5T pulse 320 
1394 
see IEEE 1394 
20T pulse 320 
25T pulse 320 
2D comb filter 433 
2T pulse 320 
3-2 pulldown 227 
3D comb filter 440 
4-1-1 YCbCr 20 
4-2-0 YCbCr 20 
4-2-2 profile 558 
4-2-2 YCbCr 19 
4-4-4 to 4-2-2 conversion 189 
4-4-4 YCbCr 19 
8-VSB 652 
helper signals 654 


A 

A/52 644 
A/53 644 
A/54 644 
A/57 644 
A/58 644 
A/64 644 
A/65 644 


A/70 644 
A/80 644 
A/90 644 
AC-3 645, 655 
adaptive comb filter 439 
advanced television enhancement forum 365 
alpha 196, 404, 440 
alpha channel 404, 440 
alpha mixing 196 
analog component video 66 
1125-line interlaced 57 
1125-line progressive 57 
525-line interlaced 37 
525-line progressive 37 
625-line interlaced 46 
625-line progressive 46 
750-line progressive 54 
YPbPr 77, 82 
anamorphic 346 
ancillary data 94 
BT.1304 104 
BT.1305 99 
BT.1364 94 
BT.1366 99 
data block number 96 
data count 96 
data ID 96 
digital audio 99 
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EIA-608 closed captioning 104 
EIA-708 closed captioning 104 
error detection checksum 104 
ITU-R BT.1304 104 
ITU-R BT.1305 99 
ITU-R BT.1364 94, 134 
ITU-R BT.1366 99 
secondary ID 96 
SMPTE 266M 104 
SMPTE 272M 99 
SMPTE 291M 94, 134 
SMPTE 334M 104 
timecode 99 
user data words 96 
video index 105 
announcement support descriptor 659 
anti-aliased resampling 216 
ATSC 644 
8-VSB 652 
pilot 654 
A/49 367 
A/52 644 
A/53 644 
A/54 644 
A/57 644 
A/58 644 
A/64 644 
A/65 644 
A/70 644 
A/80 644 
A/90 644 
AC-3 645 
audio 
commentary 647 
dialogue 647 
emergency 647 
hearing impaired 647 
main audio service 646 
music and effects 646 
visually impaired 646 
voice-over 647 
audio capability 645 
audio compression 645 


caption service descriptor 650 
closed captioning 647 
component name descriptor 651 
content advisory descriptor 650 
descriptor 650 
caption service descriptor 650 
component name descriptor 651 
content advisory descriptor 650 
extended channel name descriptor 650 
stuffing descriptor 651 
time-shifted service descriptor 651 
directed channel change selection code ta- 
ble 650 
directed channel change table 650 
Dolby Digital 645 
emergency messages 647 
event information tables 650 
extended channel name descriptor 650 
extended text tables 650 
field sync 654 
helper signals 654 
field syne 654 
pilot 654 
segment sync 654 
master guide table 649 
MPEG 2 645 
pilot 654 
program and system information protocol 
648 
PSIP 648 
rating region table 650 
stuffing descriptor 651 
system time table 649 
table 
directed channel change selection code 
table 650 
directed channel change table 650 
event information tables 650 
extended text tables 650 
master guide table 649 
rating region table 650 
system time table 649 
virtual channel table 650 
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time-shifted service descriptor 651 
transmission format 652 
transport stream 645 
video compression 645 
video resolution 645 
virtual channel table 650 
VSB 
pilot 654 
ATSC standard 644 
ATVEF 365 
audio 
AC-3 645, 655 
ancillary data 99 
ATSC 645 
BTSC 248 
Dolby Digital 645, 655 
DTV 645, 655 
DV 497 
DVB 655 
EIA-J 250 
HDTV 645, 655 
MPEG 1521, 655 
ASPEC 522 
background theory 522 
MUSICAM 521 
sound quality 521 
MPEG 2 558, 655 
NICAM 728 274 
NTSC 248 
PAL 271 
SDTV 645, 655 
zweiton 271 
automatic gain control 406 


B 

B frame 463, 524 

B pictures 565 

backward prediction 526, 567 
bandwidth-limited edge generation 398 
Betacam interface 87 

bidirectional frame 463, 524 
bidirectional pictures 565 

bidirectional prediction 526, 567 


bilinear interpolation 216 
black burst 388 
block 234 
MPEG 1 526 
MPEG 2 567 
bob and weave 230 
bouquet association table 659 
bouquet name descriptor 659 
Bresenham algorithm 216 
brightness adjustment 192 
Bruch blanking 396 
BS.707 271 
BST-COFDM 645 
BT.1119 284, 352 
BT.1124 367, 368 
BT.1197 284 
BT.1303 116 
BT.1304 104 
BT.1305 99 
BT.1358 43, 51 
BT.1362 119 
BT.1364 94, 134 
BT.1366 99 
BT.1381 140 
BT.470 447 
BT.471 297 
BT.473 317 
BT.601 35, 39 
BT.653 357 
BT.656 108 
BT.656 IC video interface 154 
BT.709 57, 62 
BT.799 116 
BT.809 359 
BTSC 248 
burst generation 384 


C 

CA identifier descriptor 659 
caption service descriptor 650 
cell frequency link descriptor 659 
cell list descriptor 659 

CGMS-A 346, 352, 357 
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chroma keying 206 

chroma spill 207 

garbage matte 209 

linear keying 206 

luminance modulation 207 

shadow chroma keying 207 

wide keying 214 
chroma spill 207 
chromaticity coordinates 29 
chromaticity diagram 26 
chrominance (analog) 384 
chrominance bars 

EBU 385 

EIA 385 
chrominance demodulation 407 
chrominance demodulator 407 
chrominance nonlinear gain 401, 446 
chrominance nonlinear phase 401, 446 
chrominance-luminance intermodulation 401, 
446 
clean encoding systems 397 
closed captioning 332, 350, 656 

ancillary data 104 

Europe 350 
closed GOP 525, 565 
COFDM 660 

helper signals 663 

pilot 663 
color bars 296 

HSI 26 

HSV 26 

NTSC 300 

PAL 303 

RGB 16 

YCbCr 18 
color burst detection 415 
color saturation accuracy 402, 446 
color space 

HLS 25 

HSI 25 

HSV 25 

PhotoYCC 24 

RGB 15 


YCbCr 17 
4-4-4 to 4-2-2 conversion 189 
YIQ 17 
YUV 16 
YUV12 20 
color space equations 
constant luminance principle 31 
conversion problems 30 
general 
RGB to YCbCr 18 
RGB to YIQ 17 
RGB to YUV 16 
YCbCr to RGB 18 
YIQ to RGB 17 
YUV to RGB 16 
HDTV to SDTV YCbCr 188 
overflow handling 31 
PhotoYCC to RGB 25 
RGB to PhotoYCC 25 
SDTV to HDTV YCbCr 188 
color transient improvement 194 
comb filter 433 
comb filtering 433 
combination test signal 317 
complementary filtering 429 
component descriptor 659 
component video 
analog 
see analog component video 
digital 
see digital component video 
composite chroma keying 214 
composite test signal 314 
constrained parameters bitstream 522, 558 
content advisory descriptor 650 
content descriptor 659 
contrast adjustment 192 
copy generation management system 346, 352, 
357 
coring 194 
country availability descriptor 660 
CPB 522, 558 
cross-color 428 


Index 


crossfading 233 
cross-luminance 428 


D 

D frame 530 

data broadcasting 658 
data carousels 658 
data piping 658 
data streaming 658 
DVB 658 
multiprotocol encapsulation 658 
object carousels 658 

data carousels 658 

data partioning 564 

data streaming 658 

datacasting 658 

DCT 234 
DV 502 
H.261 453 
MPEG 1526 
MPEG 2 567 

deinterlacing 230 
bob 230 
field merging 232 
fractional ratio interpolation 230 
inverse telecine 233 
motion adaptive 232 
motion compensated 233 
scan line duplication 230 
scan line interpolation 230 
variable interpolation 230 
weave 232 

descriptor 632, 650, 659 
announcement support descriptor 659 
audio stream 636 
bouquet name descriptor 659 
CA identifier descriptor 659 
caption service descriptor 650 
cell frequency link descriptor 659 
cell list descriptor 659 
component descriptor 659 
component name descriptor 651 


content advisory descriptor 650 
content descriptor 659 

copyright 632 

country availability descriptor 660 
data stream alignment 632 
extended channel name descriptor 650 
frequency list descriptor 660 
hierarchy 637 

language 634 

linkage descriptor 660 

local time offset descriptor 660 
maximum bitrate 638 


multilingual bouquet name descriptor 660 


multilingual component descriptor 660 


multilingual network name descriptor 660 


multilingual service descriptor 660 
multiplex buffer utilization 634 
network name descriptor 660 
NVOD reference descriptor 660 
parental rating descriptor 660 
private data 635 
registration 633 
service descriptor 660 
service list descriptor 660 
service move descriptor 660 
short event descriptor 660 
stuffing descriptor 651 
subtitling descriptor 660 
system clock 634 
target background grid 633 
teletext descriptor 660 
time shifted service descriptor 660 
time-shifted service descriptor 651 
VBI descriptor 660 
VBI teletext descriptor 660 
video stream 635 
video window 637 
DFP 164 
differential gain 401, 443 
differential luminance 401, 443 
differential phase 399, 443 
digital camera specification 183 
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digital component video 35 
1080-line interlaced 57 
1080-line progressive 62 
1125-line interlaced 57 
1125-line progressive 62 
4-4-4 to 4-2-2 YCbCr 189 
480-line interlaced 39 
480-line progressive 43 
525-line interlaced 39 
525-line progressive 43 
576-line interlaced 46 
576-line progressive 51 
625-line interlaced 46 
625-line progressive 51 
720-line progressive 54 
750-line progressive 54 
ancillary data 94 
coding ranges 35 
EAV timing 94 
F control signal 37 
F timing 94 
filtering 

CbhCr 192 

Y 189 
H control signal 37 
H timing 94 
SAV timing 94 
V control signal 37 
V timing 94 
YCbCr 17 

digital composite video 127 
25 Hz offset compensation 139 
ancillary data 134 
SCH phase 133 
TRS-ID 139 
video levels (NTSC) 127 
zero SCH phase 133 

digital flatpanel interface 164 

digital television 644 
see DTV 

digital transmission content protection 174 

digital visual interface 160 


directed channel change selection code table 


650 
directed channel change table 650 
discrete cosine transform 234 
DV 502 
H.261 453 
MPEG 1 526, 528 
MPEG 2 567, 570 
display enhancement processing 
brightness 192 
color transient improvement 194 
contrast 192 
hue 192 
saturation 192 
sharpness 194 
Dolby Digital 645, 655 
double scan 221 
drop frame 322 
DTCP 174 
DTV 644 
ATSC 644 
8-VSB 652 
A/52 644 
A/54 644 
A/57 644 
A/58 644 
A/64 644 
A/65 644 
A/70 644 
A/80 644 
A/90 644 
A53 644 
AC-3 645 
audio 
commentary 647 
dialogue 647 
emergency 647 
hearing impaired 647 
main audio service 646 
music and effects 646 
visually impaired 646 
voice-over 647 
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audio capability 645 
audio compression 645 
caption service descriptor 650 
closed captioning 647 
component name descriptor 651 
content advisory descriptor 650 
descriptor 650 
caption service descriptor 650 
component name descriptor 651 
content advisory descriptor 650 
extended channel name descriptor 
650 
stuffing descriptor 651 
time-shifted service descriptor 651 
directed channel change selection code 
table 650 
directed channel change table 650 
Dolby Digital 645 
emergency messages 647 
event information tables 650 
extended channel name descriptor 650 
extended text tables 650 
field sync 654 
helper signals 654 
field sync 654 
pilot 654 
segment sync 654 
master guide table 649 
pilot 654 
program and system information proto- 
col 648 
PSIP 648 
rating region table 650 
stuffing descriptor 651 
system time table 649 
table 
directed channel change selection 
code table 650 
directed channel change table 650 
event information tables 650 
extended text tables 650 
master guide table 649 


rating region table 650 

system time table 649 

virtual channel table 650 
time-shifted service descriptor 651 
transmission format 652 
video compression 645 
video resolution 645 
virtual channel table 650 
VSB 652 


DVB 654 


announcement support descriptor 659 
audio compression 655 
bouquet association table 659 
bouquet name descriptor 659 
CA identifier descriptor 659 
cell frequency link descriptor 659 
cell list descriptor 659 
closed captioning 656 
COFDM 660 
helper signals 663 
component descriptor 659 
content descriptor 659 
country availability descriptor 660 
data broadcasting 658 
data carousels 658 
data piping 658 
data streaming 658 
multiprotocol encapsulation 658 
object carousels 658 
descriptor 659 
announcement support descriptor 
659 
bouquet name descriptor 659 
CA identifier descriptor 659 
cell frequency link descriptor 659 
cell list descriptor 659 
component descriptor 659 
content descriptor 659 
country availability descriptor 660 
frequency list descriptor 660 
linkage descriptor 660 
local time offset descriptor 660 
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multilingual bouquet name de- 
scriptor 660 
multilingual component descriptor 
660 
multilingual network name de- 
scriptor 660 
multilingual service descriptor 660 
network name descriptor 660 
NVOD reference descriptor 660 
parental rating descriptor 660 
service descriptor 660 
service list descriptor 660 
service move descriptor 660 
short event descriptor 660 
subtitling descriptor 660 
teletext descriptor 660 
time shifted service descriptor 660 
VBI descriptor 660 
VBI teletext descriptor 660 
ETSI EN 300 421 654 
ETSI EN 300 429 654 
ETSI EN 300 468 654 
ETSI EN 300 472 655 
ETSI EN 300 744 655 
ETSI EN 301 192 655 
ETSI EN 301 775 655 
ETSI ETS 300 743 655 
event information table 659 
event information tables 659 
frequency list descriptor 660 
helper signals 663 
linkage descriptor 660 
local time offset descriptor 660 
MPEG 2 655 
multilingual bouquet name descriptor 
660 
multilingual component descriptor 660 
multilingual network name descriptor 
660 
multilingual service descriptor 660 
network name descriptor 660 
NVOD reference descriptor 660 
parental rating descriptor 660 


pilot 663 

punctured convolutional codes 662 

running status table 659 

service description table 659 

service descriptor 660 

service information 658 
network information table 659 

service list descriptor 660 

service move descriptor 660 

short event descriptor 660 

subtitles 655 

subtitling descriptor 660 

table 
bouquet association table 659 
event information table 659 
event information tables 659 
network information table 659 
running status table 659 
service description table 659 
time and date table 659 
time offset table 659 

teletext 656 

teletext descriptor 660 

time and date table 659 

time offset table 659 

time shifted service descriptor 660 

VBI data 656 

VBI descriptor 660 

VBI teletext descriptor 660 

video compression 655 

video program system 657 

video resolution 655 

VPS 657 

widescreen signalling 657 

WSS 657 


DV 495 


AAUX 498 

audio 497 

audio auxiliary data 498 
compression 503 

DCT 502 

digital interface 514 

discrete cosine transform 502 
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IEC 61834 495 
IEC 61883 495 
IEEE 1394 514 
macroblocks 502 
SDTI 514 

SMPTE 221M 495 
SMPTE 222M 495 
SMPTE 314M 495 
super block 502 
VAUX 511 

video 502 

video auxiliary data 511 


DVB 654 


announcement support descriptor 659 
audio compression 655 
bouquet association table 659 
bouquet name descriptor 659 
CA identifier descriptor 659 
cell frequency link descriptor 659 
cell list descriptor 659 
closed captioning 656 
COFDM 660 
helper signals 663 
component descriptor 659 
content descriptor 659 
country availability descriptor 660 
data broadcasting 658 
data carousels 658 
data piping 658 
data streaming 658 
multiprotocol encapsulation 658 
object carousels 658 
descriptor 659 
announcement support descriptor 659 
bouquet name descriptor 659 
CA identifier descriptor 659 
cell frequency link descriptor 659 
cell list descriptor 659 
component descriptor 659 
content descriptor 659 
country availability descriptor 660 
frequency list descriptor 660 
linkage descriptor 660 


local time offset descriptor 660 
multilingual bouquet name descriptor 
660 
multilingual component descriptor 660 
multilingual network name descriptor 
660 
multilingual service descriptor 660 
network name descriptor 660 
NVOD reference descriptor 660 
parental rating descriptor 660 
service descriptor 660 
service list descriptor 660 
service move descriptor 660 
short event descriptor 660 
subtitling descriptor 660 
teletext descriptor 660 
time shifted service descriptor 660 
VBI descriptor 660 
VBI teletext descriptor 660 
ETSI EN 300 421 654 
ETSI EN 300 429 654 
ETSI EN 300 468 654 
ETSI EN 300 472 655 
ETSI EN 300 744 655 
ETSI EN 301 192 655 
ETSI EN 301 775 655 
ETSI ETS 300 743 655 
event information table 659 
event information tables 659 
frequency list descriptor 660 
helper signals 663 
linkage descriptor 660 
local time offset descriptor 660 
MPEG 2 655 
multilingual bouquet name descriptor 660 
multilingual component descriptor 660 
multilingual network name descriptor 660 
multilingual service descriptor 660 
network name descriptor 660 
NVOD reference descriptor 660 
parental rating descriptor 660 
pilot 663 
punctured convolutional codes 662 
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running status table 659 
service description table 659 
service descriptor 660 
service information 658 
network information table 659 
service list descriptor 660 
service move descriptor 660 
short event descriptor 660 
subtitles 655 
subtitling descriptor 660 
table 
bouquet association table 659 
event information table 659 
event information tables 659 
network information table 659 
running status table 659 
service description table 659 
time and date table 659 
time offset table 659 
teletext 656 
teletext descriptor 660 
time and date table 659 
time offset table 659 
time shifted service descriptor 660 
VBI data 656 
VBI descriptor 660 
VBI teletext descriptor 660 
video compression 655 
video program system 657 
video resolution 655 
VPS 657 
widescreen signalling 657 
WSS 657 
DVI 160 
DVITC 104 
dynamic rounding 187 


E 

EBU color bars 306 
EBU N10 87 

EI frame 463 

EIA color bars 301 
EIA-189-A 301 


EIA-516 357 

EIA-608 333 

EIA-744 347 

EIA-J CPR-1204 355 
elementary bitstream 564 
EP frame 463 

error feedback rounding 187 
ETSI EN 200 421 654 

ETSI EN 300 163 274 

ETSI EN 300 294 284 

ETSI EN 300 429 654 

ETSI EN 300 468 654 

ETSI EN 300 472 655 

ETSI EN 300 744 655 

ETSI EN 301 192 655 

ETSI EN 301 775 655 

ETSI ETS 300 231 359 
ETSI ETS 300 706 357 
ETSI ETS 300 731 284 
ETSI ETS 300 732 284 
ETSI ETS 300 743 655 
euroconnector 67 

event information table 659 
event information tables 650, 659 


extended channel name descriptor 650 


extended text tables 650 


F 
F control signal 37 
F timing 94 
field identification 396, 427 
NTSC 246 
PAL 268 
SECAM 290 
field prediction 602 
field square wave test signal 314 
field syne 654 
fixed colorplus 286 
flesh tone correction 414 
forward prediction 526, 567 
fractional ratio interpolation 230 
frame dropping 219 
frame duplicating 219 
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frame prediction 602 

frame rate conversion 219 
frame dropping 219 
frame duplication 219 
motion compensation 221 
temporal interpolation 220 

frequency list descriptor 660 


G 
gamma 32 

HDTV 33 

NTSC 33 

PAL 33 

SECAM 33 
gamma correction 32 
garbage matte 209 
genlocking 403, 418 
ghost cancellation 367 
gigabit video interface 168 
graphics overlay 196 
GVIF 168 


H 

H control signal 37 

H tilt 403, 447 

H timing 94 

H.261 448 
block layer 458 
GOB layer 455 
group of blocks layer 455 
I frame 448 
inter frame 448 
intra frame 448 
macroblock layer 456 
motion compensation 451 
P frame 448 
picture layer 454 
predicted frame 448 
prediction 451 
quantization 453 
video bitstream 454 

block layer 458 
GOB layer 455 


group of blocks layer 455 
macroblock layer 456 
picture layer 454 

zig-zag scan 458 


H.263 463 


advanced intra coding mode 489 
advanced prediction mode 489 
alternative inter VLC mode 493 

B frame 463 

bidirectional frame 463 

block layer 476 

coding algorithm 464 

coding control 464 

continuous presence multipoint 489 
CPM 489 

deblocking filter mode 490 

EI frame 463 

EP frame 463 

forced updating 466 

forward error correction mode 489 
GOB layer 469 

group of blocks layer 469 

I frame 463 

improved PB frame 463 

improved PB frames mode 491 


independent segment decoding mode 493 


inter frame 463 

intra frame 463 

levels 494 

macroblock layer 470 

modified quantization mode 493 
motion compensation 464 

P frame 463 

PB frame 463 

PB frames mode 489 

picture layer 466 

plustype picture layer option 483 
predicted frame 463 

prediction 464 

quantization 464 

reduced resolution update mode 493 
reference picture resampling mode 492 
reference picture selection mode 491 
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slice structured mode 490 

SNR scalability mode 491 

spatial scalabilty mode 491 

supplemental enhancement information 
490 


syntax-based arithmetic coding mode 488 


temporal scalability mode 491 
unrestricted motion vector mode 487 
version 2 489 
video bitstream 466 
block layer 476 
GOB layer 469 
group of blocks layer 469 
macroblock layer 470 
picture layer 466 
zig-zag scan 490 
H.263+ 463 
H.263++ 494 
hanging dots 439 
Hanover bars 410 
HAVi 176 
HD-SDTI 144 
HDTV 
ATSC 644 
DVB 654 
gamma 33 
helper signals 654, 663 
8-VSB 654 
COFDM 663 
field sync 654 
pilot 654 
segment sync 654 
high data-rate serial data transport interface 
144 
high profile 564 
HLS 25 
house sync 388 
HSI 25 
color bars 26 
color space 25 
HSV 25 
color bars 26 
color space 25 


hue accuracy 402, 446 
hue adjustment 192 


I 


I frame 448, 463, 523 
I pictures 565 

IEC 60933-1 67, 720 
IEC 60933-2 67, 720 
IEC 60933-5 67, 727 
IEC 61834 176, 495 
IEC 61880 355 

IEC 61883 176, 495 
IEEE 1394 170 


asynchronous data 172 

bus manager nodes 171 

cycle master nodes 171 

digital camera specification 183 
digital transmission content protection 174 
DTCP 174 

DV 495 

DV transfers 176 

endian issues 170 

HAVi 176 

IEC 61834 176 

IEC 61883 176, 495 
isochronous data 173 
isochronous nodes 171 

link layer 172 

network topology 170 

node ports 171 

node types 171 

OHCI 176 

open host controller interface 176 
physical layer 172 

SBP-2 176 

serial bus protocol 176 

SMPTE 314M 176 
specifications 170 

transaction nodes 171 


improved PB frame 463 

inter frame 448, 463, 524 
inter picture 565 

inter-field Y/C separation 440 
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interlaced-to-noninterlaced conversion 230 


intra frame 448, 463, 523 
intra pictures 565 
intra-field Y/C separation 433 
inverse telecine 233 

IP over VBI 366 

IPVBI 366 

ISDB 645 

ISO/IEC 10918 519 
ISO/IEC 11172 519 
ISO/IEC 13818 557 
ITU multiburst test signal 312 
ITU-R BS.707 271 
ITU-R BT.1119 284, 352 
ITU-R BT.1124 367 
ITU-R BT.1197 284 
ITU-R BT.1303 116 
ITU-R BT.1304 104 
ITU-R BT.1305 99 
ITU-R BT.1358 43, 51 
ITU-R BT.1362 119 
ITU-R BT.1364 94, 134 
ITU-R BT.1366 99 
ITU-R BT.1381 140 
ITU-R BT.470 447 
ITU-R BT.471 297 
ITU-R BT.473 317 
ITU-R BT.601 35, 39 
ITU-R BT.653 357 
ITU-R BT.656 108 
ITU-R BT.709 57, 62 
ITU-R BT.799 116 
ITU-R BT.809 359 


J 
jam sync 322 


K 
keying 203 
chroma 206 
composite chroma keying 214 
luminance 203 
superblack 214 


L 
levels 558 
high 1440 level 558 
high level 558 
low level 558 
main level 558 
line bar test signal 312 
linear interpolation 216 
linear keying 206 
linkage descriptor 660 
local time offset descriptor 660 
luminance keying 203 
luminance modulation 207 
luminance nonlinearity 401, 443 


M 
M/PAL 282 
macroblock 234 

DV 502 

MPEG 1 526 

MPEG 2 567 
main profile 558 
master guide table 649 
meander gate 396 
MII interface 87 
mixing 196 
MJPEG 520 
modulated pedestal test signal 310 
modulated ramp test signal 309 
modulated staircase test signal 310 
motion adaptive colorplus 286 
motion adaptive deinterlacing 232 
motion adaptive Y/C separation 440 
motion compensated deinterlacing 233 
motion compensation 525, 566, 602 

frame rate conversion 221 
motion JPEG 520 
MPEG 1 519 

audio 521 

background theory 522 
sound quality 521 
B frame 524 
backward prediction 526 
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bidirectional frame 524 

bidirectional prediction 526 

block 526 

block layer 542 

closed GOP 525 

coded frame types 523 

constrained parameters bitstream 522 

CPB 522 

D frame 530 

DCT 526 

decode postprocessing 555 

decoding video 555 

encode preprocessing 523 

fast playback 555 

forward prediction 526 

group of pictures (GOP) 524 

group of pictures layer 534 

I frame 523 

inter frame 524 

interlaced video 523 

intra frame 523 

ISO/IEC 11172 519 

ISO/IEC 11172 layer 550 

macroblock 526 

macroblock layer 538 

motion compensation 525 

open GOP 525 

P frame 524 

pack layer 550 

packet layer 553 

pause mode 555 

picture layer 535 

postprocessing 555 

predicted frame 524 

preprocessing 523 

quality issues 520 

quantizing 527 

reverse playback 555 

sequence header 531 

slice layer 537 

system bitstream 550 
ISO/IEC 11172 layer 550 
pack layer 550 


packet layer 553 
system header 551 
system header 551 
timecode 556 
variable bit rate 556 
video 522 
video bitstream 531 
block layer 542 
group of pictures layer 534 
macroblock layer 538 
picture layer 535 
sequence header 531 
slice layer 537 
video sequence 531 
video sequence 531 
zig-zag scan 527 
MPEG 2 557 
4-2-2 profile 558 
audio 558 
audio/video synchronization 639 
B pictures 565 
backward prediction 567 
bidirectional pictures 565 
bidirectional prediction 567 
bit rates 563 
block layer 601 
closed GOP 565 


constrained parameters bitstream 558 


CPB 558 

decoding video 639 

descriptor 632 
audio stream 636 
copyright 632 
data alignment 632 
hierarchy 637 
language 634 
maximum bitrate 638 
multiplex buffer utilization 634 
private data 635 
registration 633 
system clock 634 
target background grid 633 
video stream 635 
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video window 637 
elementary bitstream 564 
field prediction 602 
forward prediction 567 
frame prediction 602 
GOP layer 583 
group of pictures (GOP) 565 
group of pictures layer 583 
high level 558 
high profile 564 
I pictures 565 
inter picture 565 
intra pictures 565 
ISO/IEC 13818 557 
levels 558 

high 1440 level 558 

high level 558 

low level 558 

main level 558 
low level 558 
macroblock 567 
macroblock layer 593 
main level 558 
main profile 558 
motion compensation 566, 602 

field prediction 602 

frame prediction 602 
multiview profile 558 
open GOP 565 
P pictures 565 
pack layer 619 
packet layer 623 
PCR 639 
PES packet 624 
picture coding extension 584 
picture display extension 589 
picture layer 583 
picture spatial scalable extension 590 
predicted pictures 565 
private data 638 
profiles 

4-2-2 profile 558 

high profile 564 


main profile 558 
multiview profile 558 
simple profile 558 
SNR profile 564 
spatial profile 564 
studio profile 558 
program reference clock 639 
program stream 619, 623 
pack layer 619 
private data 638 
program stream map 621 
system header 620 
program stream map 621 
quant matrix extension 588 
quantizing 569 
scalability 564 
data partitioning 564 
SNR scalability 564 
spatial scalability 564 
temporal scalability 564 
sequence display extension 577 
sequence extension 576 
sequence header 573 
sequence scalable extension 579 
simple profile 558 
slice layer 592 
SNR profile 564 
spatial profile 564 
studio profile 558 
system header 620 
transport stream 623 
packet layer 623 
user data 576 
video bitstream 571 
block layer 601 
GOP layer 583 
group of pictures layer 583 
macroblock layer 593 
picture coding extension 584 
picture display extension 589 
picture layer 583 


picture spatial scalable extension 590 


quant matrix extension 588 
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sequence display extension 577 
sequence extension 576 
sequence header 573 
sequence scalable extension 579 
slice layer 592 
user data 576 
video sequence 573 
video sequence 573 
zig-zag scan 569 
MPEG 2.5 558 
multiburst test signal 312 
multilingual bouquet name descriptor 660 
multilingual component descriptor 660 
multilingual network name descriptor 660 
multilingual service descriptor 660 
multiprotocol encapsulation 658 
multipulse test signal 312 
multiview profile 558 


N 
N/PAL 282 
NABTS 357 
network name descriptor 660 
NICAM 728 274 
noncomplementary filtering 429 
noninterlaced 

NTSC 248 

PAL 271 
noninterlaced-to-interlaced conversion 228 
NTC-7 combination test signal 317 
NTC-7 composite video test signal 314 
NTSC 239 

4-field sequence 246 

channel assignments 250 

closed captioning 332 

countries 250 

formats 247 

IP over VBI 366 

noninterlaced 248 

overview 239 

RF modulation 247 

teletext 357 

timecode 321 


VBI data 321 
widescreen signalling 352 
WSS 

see widescreen signalling 


NTSC decoding 404 


10-step staircase test signal 309 

12.5T pulse 320 

25T pulse 320 

2D comb filter 433 

2T pulse 320 

3D comb filter 440 

adaptive comb filter 439 

alpha 440 

alpha channel 440 

auto detect 428 

automatic gain control 406 

BT.470 447 

chrominance demodulation 407 

chrominance nonlinear gain 446 

chrominance nonlinear phase 446 

chrominance-luminance intermodulation 
446 

color bars test signal 296 

color burst detection 415 

color saturation accuracy 446 

comb filter 433 

combination test signal 317 

complementary filtering 429 

composite test signal 314 

composite video digitizing 404 

cross-color 428 

cross-luminance 428 

deinterlacing 230 

differential gain 443 

differential phase 443 

EIA color bars 301 

field identification 427 

field signal 426 

field square wave test signal 314 

filtering 410 

flesh tone correction 414 

gamma 33 

genlocking 418 
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H tilt 447 
horizontal blanking 426 
horizontal sync 426 
hue accuracy 446 
hue adjustment 414 
inter-field comb filter 440 
interlaced-to-noninterlaced conversion 230 
intra-field comb filter 433 
ITU-R BT.470 447 
line bar test signal 312 
luminance nonlinearity 443 
modulated pedestal test signal 310 
modulated ramp test signal 309 
modulated staircase test signal 310 
motion adaptive Y/C separation 440 
multiburst test signal 312 
multipulse test signal 312 
NTC-7 combination test signal 317 
NTC-7 composite test signal 314 
PLUGE test signal 307 
progressive scan conversion 230 
red field test signal 309 
reverse blue bars test signal 306 
SMPTE bars test signal 306 
subcarrier generation 423 
subcarrier locking 423 
s-video connector 67 
T pulse 320 
V tilt 447 
vertical blanking 427 
vertical syne 426 
video parameters 443 
chrominance nonlinear gain 446 
chrominance nonlinear phase 446 
chrominance-luminance intermodula- 
tion 446 
color saturation accuracy 446 
differential gain 443 
differential luminance 443 
differential phase 443 
H tilt 447 
hue accuracy 446 
luminance nonlinearity 443 


V tilt 447 

video test signals 296 
10-step staircase 309 
12.5T pulse 320 
25T pulse 320 
2T pulse 320 
color bars 296 
combination 317 
composite 314 
EIA color bars 301 
field square wave 314 
line bar 312 
modulated pedestal 310 
modulated ramp 309 
modulated staircase 310 
multiburst 312 
multipulse 312 
NTC-7 composite 314 
PLUGE 307 
red field 309 
reverse blue bars 306 
SMPTE bars 306 
T pulse 320 
Y bars 308 

video timing generation 426 

Y bars test signal 308 

Y/C connector 67 

Y/C separation 428 
2D comb filter 433 
3D comb filter 440 
adaptive comb filter 439 
comb filter 433 
complementary filtering 429 
inter-field comb filter 440 
intra-field comb filter 433 
motion adaptive 440 
noncomplementary filtering 429 
simple 429 

NTSC encoding 371 

10-step staircase test signal 309 

12.5T pulse 320 

25T pulse 320 

2T pulse 320 
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alpha 404 

alpha channel 404 

bandwidth-limited edge generation 398 

black burst 388 

BT.470 447 

burst generation 384 

chrominance frequency spectra 380 

chrominance nonlinear gain 401 

chrominance nonlinear phase 401 

chrominance-luminance intermodulation 
401 

clean encoding 397 

color bars test signal 296 

color saturation accuracy 402 

combination test signal 317 

composite test signal 314 

composite video generation 386 

differential gain 401 

differential luminance 401 

differential phase 399 

EIA chrominance color bars 385 

EIA color bars 301 

field identification 396 

field square wave test signal 314 

filtering 378 

gamma 33 

genlocking 403 

H tilt 403 

horizontal timing 393 

house sync 388 

hue accuracy 402 

ITU-R BT.470 447 

line bar test signal 312 

luminance generation 375 

luminance nonlinearity 401 

modulated pedestal test signal 310 

modulated ramp test signal 309 

modulated staircase test signal 310 

multiburst test signal 312 

multipulse test signal 312 

noninterlaced-to-interlaced conversion 228 

NTC-7 combination test signal 317 

NTC-7 composite test signal 314 


PLUGE test signal 307 
red field test signal 309 
residual subcarrier 402 
reverse blue bars test signal 306 
SCH phase 402 
SMPTE bars test signal 306 
subcarrier generation 389 
s-video connector 67 
s-video output skew 403 
T pulse 320 
V tilt 403 
vertical timing 393 
video levels 
chrominance (C) 385 
composite 386 
luminance (Y) 377 
video parameters 399 
chrominance nonlinear gain 401 
chrominance nonlinear phase 401 
chrominance-luminance intermodula- 
tion 401 
color saturation accuracy 402 
differential gain 401 
differential luminance 401 
differential phase 399 
H tilt 403 
hue accuracy 402 
luminance nonlinearity 401 
residual subcarrier 402 
SCH phase 402 
V tilt 403 
Y/C output skew 403 
video test signals 296 
10-step staircase 309 
12.5T pulse 320 
25T pulse 320 
2T pulse 320 
color bars 296 
combination 317 
composite 314 
EIA color bars 301 
field square wave 314 
line bar 312 


Index 


751 


modulated pedestal 310 
modulated ramp 309 
modulated staircase 310 
multiburst 312 
multipulse 312 
NTC-7 composite 314 
PLUGE 307 
red field 309 
reverse blue bars 306 
SMPTE bars 306 
T pulse 320 
Y bars 308 

Y bars test signal 308 

Y/C connector 67 

Y/C output skew 403 

NVOD reference descriptor 660 


O 

object carousels 658 

OHCI 176 

open GOP 525, 565 

open host controller interface 176 
OpenCable 645 

openLDI 166 

openLVDS 166 

oversampled VBI data 366 


P 

P frame 448, 463, 524 

P pictures 565 

PAL 262 
channel assignments 277 
closed captioning 350 
countries 282 
formats 272 
NICAM 728 274 
noninterlaced 271 
overview 262 
RF modulation 266 
teletext 357 
timecode 321 
VBI data 321 
widescreen signalling 352 


WSS 
see widescreen signalling 


PAL decoding 404 


10-step staircase test signal 309 
10T pulse 320 

20T pulse 320 

2D comb filter 433 

2T pulse 320 

3D comb filter 440 

adaptive comb filter 439 

alpha 440 

alpha channel 440 

auto detect 428 

automatic gain control 406 
BT.470 447 

chrominance demodulation 407 
chrominance nonlinear gain 446 
chrominance nonlinear phase 446 


chrominance-luminance intermodulation 


446 
color bars test signal 296 
color burst detection 415 
color saturation accuracy 446 
comb filter 433 
complementary filtering 429 
composite video digitizing 404 
cross-color 428 
cross-luminance 428 
deinterlacing 230 
differential gain 443 
differential phase 443 
EBU color bars 306 
euroconnector 67 
field identification 428 
field signal 426 
field square wave test signal 314 
filtering 410 
flesh tone correction 414 
gamma 33 
genlocking 418 
H tilt 447 
Hanover bars 410 
horizontal blanking 426 
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horizontal sync 426 
hue accuracy 446 
hue adjustment 414 
inter-field comb filter 440 
interlaced-to-noninterlaced conversion 230 
intra-field comb filter 433 
ITU multiburst test signal 312 
ITU-R BT.470 447 
line bar test signal 312 
luminance nonlinearity 443 
modulated pedestal test signal 310 
modulated ramp test signal 309 
modulated staircase test signal 310 
motion adaptive Y/C separation 440 
multiburst test signal 312 
multipulse test signal 312 
PAL delay line 431 
PAL modifier 431 
PAL switch 426 
peritel connector 67 
PLUGE test signal 307 
progressive scan conversion 230 
red field test signal 309 
reverse blue bars test signal 306 
SCART connector 67 
simple PAL decoder 431 
subcarrier generation 423 
subcarrier locking 423 
s-video connector 67 
T pulse 320 
V tilt 447 
vertical blanking 427 
vertical sync 426 
video parameters 443 
chrominance nonlinear gain 446 
chrominance nonlinear phase 446 
chrominance-luminance intermodula- 
tion 446 
color saturation accuracy 446 
differential gain 443 
differential luminance 443 
differential phase 443 
H tilt 447 


hue accuracy 446 
luminance nonlinearity 443 
V tilt 447 

video test signals 296 
10-step staircase 309 
10T pulse 320 
20T pulse 320 
2T pulse 320 
color bars 296 
combination 317 
composite 314 
EBU color bars 306 
field square wave 314 
ITU multiburst 312 
line bar 312 
modulated pedestal 310 
modulated ramp 309 
modulated staircase 310 
multiburst 312 
multipulse 312 
PLUGE 307 
red field 309 
reverse blue bars 306 
T pulse 320 
Y bars 308 

video timing generation 426 

Y bars test signal 308 

Y/C connector 67 

Y/C separation 428 
2D comb filter 433 
3D comb filter 440 
adaptive comb filter 439 
comb filter 433 
complementary filtering 429 
inter-field comb filter 440 
intra-field comb filter 433 
motion adaptive 440 
noncomplementary filtering 429 
PAL delay line 431 
PAL modifier 431 
simple 429 

PAL delay line 431 
PAL encoding 371 
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10-step staircase test signal 309 

10T pulse 320 

20T pulse 320 

2T pulse 320 

alpha 404 

alpha channel 404 

bandwidth-limited edge generation 398 

black burst 388 

Bruch blanking 396 

BT.470 447 

burst generation 384 

chrominance frequency spectra 381 

chrominance nonlinear gain 401 

chrominance nonlinear phase 401 

chrominance-luminance intermodulation 
401 

clean encoding 397 

color bars test signal 296 

color saturation accuracy 402 

composite video generation 386 

differential gain 401 

differential luminance 401 

differential phase 399 

EBU chrominance color bars 385 

EBU color bars 306 

euroconnector 67 

field identification 396 

field square wave test signal 314 

filtering 378 

gamma 33 

genlocking 403 

H tilt 403 

horizontal timing 393 

house sync 388 

hue accuracy 402 

ITU multiburst test signal 312 

ITU-R BT.470 447 

line bar test signal 312 

luminance generation 375 

luminance nonlinearity 401 

meander gate 396 

modulated pedestal test signal 310 

modulated ramp test signal 309 


modulated staircase test signal 310 
multiburst test signal 312 
multipulse test signal 312 
noninterlaced-to-interlaced conversion 228 
PAL switch 389 
peritel connector 67 
PLUGE test signal 307 
red field test signal 309 
residual subcarrier 402 
reverse blue bars test signal 306 
SCART connector 67 
SCH phase 402 
subcarrier generation 389 
s-video connector 67 
s-video output skew 403 
T pulse 320 
V tilt 403 
vertical timing 393 
video levels 
chrominance (C) 385 
composite 387 
luminance (Y) 377 
video parameters 399 
chrominance nonlinear gain 401 
chrominance nonlinear phase 401 
chrominance-luminance intermodula- 
tion 401 
color saturation accuracy 402 
differential gain 401 
differential luminance 401 
differential phase 399 
H tilt 403 
hue accuracy 402 
luminance nonlinearity 401 
residual subcarrier 402 
SCH phase 402 
V tilt 403 
Y/C output skew 403 
video test signals 296 
10-step staircase 309 
10T pulse 320 
20T pulse 320 
2T pulse 320 
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color bars 296 
combination 317 
composite 314 

EBU color bars 306 
field square wave 314 
ITU multiburst 312 
line bar 312 


modulated pedestal 310 


modulated ramp 309 


modulated staircase 310 


multiburst 312 
multipulse 312 
PLUGE 307 
red field 309 
reverse blue bars 306 
T pulse 320 
Y bars 308 
Y bars test signal 308 
Y/C connector 67 
Y/C output skew 403 
PAL modifier 431 
PAL switch 389, 426 
PALplus 284 
parental rating descriptor 660 
PB frame 463 
improved 463 
PDC 359 
peaking filter 194 
peritel connector 67 
PhotoYCC 24 
PLUGE test signal 306 
predicted frame 448, 463, 524 
predicted pictures 565 
profiles 
4-2-2 profile 558 
high profile 564 
main profile 558 
multiview profile 558 
simple profile 558 
SNR profile 564 
spatial profile 564 
studio profile 558 


program and system information protocol 648 


program delivery control 359 
program stream 619 

progressive DCT 520 

progressive scan conversion 230 
PSIP 648 

punctured convolutional codes 662 


Q 
QSIF 46 
square pixel 46 
quantization 236 
quantizing 236 
H.261 453 
H.263 464 
MPEG 1 527 
MPEG 2 569 


R 
raw VBI data 366 
red field test signal 309 
Reed-Solomon encoding 653, 661 
residual subcarrier 402 
reverse blue bars test signal 306 
RF modulation 
NTSC 247 
PAL 266 
RGB 
color bars 16 
color space 15 
RGB interface 
HDTV 75 
digitization 77 
generation 75 
SDTV 71 
0 IRE blanking pedestal 74 
7.5 IRE blanking pedestal 71 
digitization 74, 75 
generation 71, 74 
VGA 86 


Index 


755 


rounding 
conventional 187 
dynamic 187 
error feedback 187 
truncation 187 
run length coding 236 
running status table 659 


Ss 
saturation adjustment 192 
SBP-2 176 
scalability 
data partioning 564 
SNR scalability 564 
spatial scalability 564 
temporal scalability 564 
scaling 215 
anti-aliased resampling 216 
bilinear interpolation 216 
Bresenham algorithm 216 
linear interpolation 216 
pixel dropping 216 
pixel duplication 216 
scan conversion 
interlaced to noninterlaced 230 
noninterlaced to interlaced 228 
SCART connector 67 
SCH phase 133, 402 
SDI 113 
SDTI 140 
DV 495 
SMPTE 221M 495 
SMPTE 222M 495 
SDTV 
ATSC 644 
DVB 654 
SECAM 287 
4-field sequence 290 
countries 289 
formats 291 
gamma 33 
overview 287 
segment sync 654 


sequential DCT 519 
serial bus protocol 176 
serial data transport interface 140 
service description table 659 
service descriptor 660 
service information 658 

network information table 659 
service list descriptor 660 
service move descriptor 660 
shadow chroma keying 207 
sharpness control 194 
short event descriptor 660 
SIF 43 

square pixel 43 
simple profile 558 
sine-squared pulse 320, 398 
sliced VBI data 367 
SMPTE 125M 116 
SMPTE 12M 321 
SMPTE 221M 495 
SMPTE 222M 495 
SMPTE 244M 134 
SMPTE 259M 113 
SMPTE 262M 329 
SMPTE 266M 104 
SMPTE 267M 39, 116 
SMPTE 272M 99 
SMPTE 274M 57, 62, 112 
SMPTE 291M 94, 134 
SMPTE 292M 115 
SMPTE 293M 43 
SMPTE 296M 54 
SMPTE 305M 140 
SMPTE 309M 329 
SMPTE 314M 176, 495 
SMPTE 334M 104 
SMPTE 344M 115 
SMPTE 348M 144 
SMPTE bars test signal 306 
SNR scalability 564 
spatial profile 564 
spatial scalability 564 
spectrum locus 26 
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studio profile 558 
stuffing descriptor 651 
subcarrier generation 389, 423 
subcarrier locking 423 
subtitles 655 
subtitling descriptor 660 
super block 502 
superblack keying 214 
S-video 376, 384 
s-video interface 66 
extended 67 
s-video output skew 403 
system bitstream 550 
system time table 649 


T 

T pulse 320, 398 

T step 398 

table 
bouquet association table 659 
event information table 659 
network information table 659 
running status table 659 
service description table 659 
time and date table 659 
time offset table 659 

teletext 357, 656 

teletext descriptor 660 

temporal interpolation 220 

temporal rate conversion 219 

temporal scalability 564 

text overlay 196 


three-level chrominance bar test signal 310 


time and date table 659 
time offset table 659 
time shifted service descriptor 660 
timecode 321 
ancillary data 99 
drop frame 322 
jam sync 322 
longitudinal timecode 322 
LTC 322 
user data 330 


vertical interval timecode 325 
VITC 325 
time-shifted service descriptor 651 
transport interfaces 140 
HD-SDTI 144 


high data-rate serial data transport inter- 


face 144 
IEEE 1394 170 
ITU-R BT.1381 140 
SDTI 140 


serial data transport interface 140 


SMPTE 305M 140 

SMPTE 348M 144 
transport stream 623 
Trellis encode 653 
tristimulus values 26 


U 

user controls 192 
brightness adjustment 192 
contrast adjustment 192 
hue adjustment 192 
saturation adjustment 192 
sharpness adjustment 194 


Vv 
v chip 346 
V control signal 37 
V tilt 403, 447 
V timing 94 
variable interpolation 230 
varispeed 228 
VBI data 321 
ancillary data 94 
closed captioning 104, 332 
ghost cancellation 367 
NABTS 357 
oversampled 366 
raw 366 
sliced 367 
teletext 357 
timecode 99, 321 
vertical interval timecode 325 
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VITC 325 
widescreen signalling 352 
WSS 
see widescreen signalling 
VBI descriptor 660 
VBI teletext descriptor 660 
vertical interval timecode 325 
vertical timing 393 
VGA connector 86 
video bitstream 
H.261 454 
H.263 466 
MPEG 1531 
MPEG 2571 
video conferencing 448 
video index 105 
ancillary data 105 
video interface - analog 66 
Betacam 86 
EBU N10 86 
euroconnector 67 
MII 86 
peritel 67 
RGB HDTV 75 
RGB SDTV 71 
SCART 67 
SMPTE 86 
s-video 66 
VGA 86 
YPbPr HDTV 82 
YPbPr SDTV 77 
video interface - digital - consumer 160 
DFP 164 
digital flat panel interface 164 
digital visual interface 160 
DVI 160 
gigabit video interface 168 
GVIF 168 
open LVDS display interface 166 
openLDI 166 
video interface - digital - IC 147 
BT.656 154 
standard video interface 147 


video interface port 156 
video module interface 152 
VIP 156 

VMI 152 


video interface - digital - pro video 92 


BT.1120 112 
BT.1302 110, 115 
BT.1303 116 
BT.1362 119 
BT.656 108, 113 
BT.799 116 

SDI 113 

SMPTE 125M 108 
SMPTE 259M 113 
SMPTE 267M 110 
SMPTE 274M 112 
SMPTE 292M 115 
SMPTE 294M 119 
SMPTE 344M 115 


video interface port 156 
video levels 


NTSC 
chrominance (C) 385 
composite 386 
luminance (Y) 377 
PAL 
chrominance (C) 385 
composite 387 
luminance (Y) 377 
RGB 71 
YPbPr 77 


video mixing 196 
video module interface 152 
video parameters 399, 443 


chrominance nonlinear gain 401, 446 


chrominance nonlinear phase 401, 446 


chrominance—to—luminance intermodula- 


tion 401 
color saturation accuracy 402, 446 
differential gain 401, 443 
differential luminance 401, 443 
differential phase 399, 443 
H tilt 403, 447 
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hue accuracy 402, 446 
luminance nonlinearity 401, 443 
residual subcarrier 402 

SCH phase 402 

V tilt 403, 447 

Y/C output skew 403 


video processing 186 


alpha mixing 196 

brightness adjustment 192 
chroma keying 206 

color transient improvement 194 
composite chroma keying 214 
contrast adjustment 192 

coring 194 

deinterlacing 230 

frame rate conversion 219 
graphics overlay 196 

hue adjustment 192 

keying 203 

luminance keying 203 
noninterlaced-to-interlaced conversion 228 
peaking filter 194 


20T pulse 320 

25T pulse 320 

2T pulse 320 

color bars 296 
combination 317 
composite 314 

EBU color bars 306 
EIA color bars 301 

field square wave 314 
ITU multiburst 312 

line bar 312 

modulated pedestal 310 
modulated ramp 309 
modulated staircase 310 
multiburst 312 
multipulse 312 

NTC-7 combination 317 
NTC-7 composite 314 
PLUGE 307 

red field 309 

reverse blue bars 306 
sine-squared pulse 320 


rounding 187 SMPTE bars 306 
saturation adjustment 192 T pulse 320 
scaling 215 three-level chrominance bar 310 
anti-aliased resampling 216 Y bars 308 
bilinear interpolation 216 video timing 293 
linear interpolation 216 VIP 156 
pixel dropping 216 virtual channel table 650 
pixel duplication 216 VITC 325 
sharpness 194 VMI 152 
text overlay 196 VPS 362, 657 
video mixing 196 VSB 652 
video program system 657 helper signals 654 
video programming system 362 
video test signals 296 WwW 
color bars wide keying 214 
NTSC 300 widescreen signalling 352, 657 
PAL 303 EIA-J CPR-1204 355 
NTSC/PAL ETSI EN 300 294 352 
10-step staircase 309 IEC 61880 355 
10T pulse 320 ITU-R BT.1119 352 
12.5T pulse 320 PALplus 284 
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SCART 68 
s-video 67 
WSS 
see widescreen signalling 


Y 

Y bars test signal 308 

Y/C output skew 403 

Y/C separation 428 
2D comb filter 433 
3D comb filter 440 
adaptive comb filter 439 
comb filter 433 
complementary filtering 429 
inter-field 440 
intra-field 433 
motion adaptive 440 
noncomplementary filtering 429 
PAL delay line 431 
PAL modifier 431 
simple 429 

Y/C video 376, 384 

YCbCr 17 
4-1-1 format 20 
4-2-0 format 20 
4-2-2 format 19 
4-4-4 format 19 
4-4-4 to 4-2-2 189 
color bars 18 
color space 17 


YIQ 17 
YPbPr 77 
YPbPr analog video 82 
YPbPr interface 
Betacam 87 
EBU N10 87 
HDTV 82 
digitization 86 
generation 82 
MII 87 
SDTV 77 
digitization 81 
generation 81 
SMPTE 87 
YUV 16 
YUV12 20 


Z 

zero SCH phase 133 

zig-zag scan 236 
H.261 458 
H.263 490 
MPEG 1 527 
MPEG 2 569 

zweiton 271 
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1-878707-16-7 $49.95 


Digital Frequency Synthesis Demystified 

by Bar-Giora Goldberg 

INCLUDES WINDOWS CD-ROM. An 
essential reference for electronics engineers 
covering direct digital synthesis (DDS) and PLL 
frequency synthesis. The accompanying CD- 
ROM contains useful design tools and ex- 
amples, and a DDS tutorial. 

1-878707-47-7 $49.95 


Bebop to the Boolean Boogie 

An Unconventional Guide to Electronics 
Fundamentals, Components, and Processes 

by Clive “Max” Maxfield 

The essential reference on modern electronics, 
written with wit and style. Worth the price for 
the glossary alone! 

1-878707-22-1 $35.00 


Modeling Engineering Systems 

PC-Based Techniques and Design Tools 

by Jack W. Lewis 

INCLUDES WINDOWS CD-ROM. Teaches the 
fundamentals of math modeling and shows how 
to simulate any engineering system using a PC 
spreadsheet. 

1-878707-08-6 $29.95 


Fibre Channel, Second Edition 

Connection to the Future 

by the Fibre Channel Industry Association 

A concise guide to the fundamentals of the 
popular ANSI Fibre Channel standard for high- 
speed computer interconnection. 

1-878707-45-0 $16.95 
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